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LEADERSHIP BEHAVIOR ASSOCIATED WITH 
THE ADMINISTRATIVE REPUTATION OF 
COLLEGE DEPARTMENTS 


JOHN K. HEMPHILL 


The Ohio State University 


I, INTRODUCTION 


The effectiveness of administration within educational institu- 
tions is rarely assessed in a systematic or formal manner. Conse- 
quently, in research on problems of educational administration, 
the staff of the Ohio State Leadership Studies' has looked for 
methods for making reasonable estimates of the administrative 
effectiveness. This paper is concerned with characteristics of the 
“reputation for being well administered” as a criterion of the 
quality of administration in college departments. 

Opinions of his peers, co-workers or ‘“‘buddies’”’ about the quality 
of an individual’s performance have been demonstrated to provide 
useful criterion data (11). The question of whether analogous 
opinions can be found that will reflect the quality of educaticnal 
administration is suggested immediately. Are departments of a 
college characterized by a reputation for their administrative ex- 
cellence? If such reputations exist do they reflect valid differences 
in administrative practices? One part of a more comprehensive 





‘The Ohio State Leadership Studies, initiated in 1946 by Carroll L:- 
Shartle, have been planned as a ten-year program of research on leadership 
and administration problems in military, business, industrial, educational, 
and civilian governmental organizations. The research reported here was 
supported by a grant received from the Rockefeller Foundation for the 
study of administration in education. Melvin Seeman and Andrew W. Hal- 
pin, as members of the staff of the Ohio State Leadership Studies, have 
conducted related studies of public school administration. 
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study of the departments within a single Liberal Arts college pro- 
vides some evidence to support the proposition that the “reputation 
for being well administered”’ is useful criterion information. 


II, PROCEDURE 


The Study: This report concerns only a portion of a larger study 
of the leadership and administration of 22 college departments in 
a moderately large mid-western university.? The larger study had 
as its general purpose, the testing of the applicability of research 
methods being developed by the Ohio State Leadership Studies to 
the university situation. These methods consisted of series of 
questionnaires and rating forms which had been used previously in 
military (3), business (1) and public school systems (8). 

Codéperation necessary for the study was first secured from the 
dean of the Liberal Arts college and the 22 department heads. 
Later all faculty members were informed of the nature and objec- 
tives of the study by means of (a) a letter of introduction signed by 
the dean, (b) personal meetings with each department’s faculty 
and (c) written instructions for completing a packet of question- 
naires to be submitted anonymously. Although participation was 
voluntary, 234 of the 322 faculty members returned their question- 
naires after two follow-up letters. 

The questionnaires were distributed in four types of packets 
made up as follows: 

Type A: (These questionnaires were distributed to the 22 depart- 
ment heads) 

1) Background Information Questionnaire 

2) Leader Behavior Description Questionnaire (self description 

* directions) 

3) Responsibility, Authority and Delegation scales (9) 

4) Reputational Ranking Form 
Type B: (These questionnaires were distributed to a randomly 

selected third of the members of each department’s 
faculty) 

1) Background Information Questionnaire 

2) Leader Behavior Description Questionnaire (General form 

with directions to describe the actual behavior of the faculty 
member’s department chairman) 





2? Vernon Jon Bentz, Psychologist, Sears, Roebuck and Co. assisted in 
planning this study, and collected the data. 








Leadership and Administration of College Departments 387 


3) Group Dimensions Description Questionnaire 

4) Reputational Ranking Form 
Type C: (These questionnaires were distributed to a second ran- 

domly selected third of the members of each depart- 
ment) 

1) Background Information Questionnaire 

2) Leader Behavior Description Questionnaire (General form 
with directions to describe the behavior of an “‘ideal’’ leader 
of the faculty member’s department) 

3) Leader Behavior Description Questionnaire (Specific form, 
with directions to describe the actual behavior of the faculty 
member’s department chairman) 

4) Reputational Ranking Form 

Type D: (These questionnaires were distributed to a third ran- 
domly selected third of the members of each department’s 
faculty) 

1) Background Information Questionnaire 

2) Status Position Scales* 

3) Status Perception Scales* 

4) Group Dimensions Questionnaire 

5) Reputational Ranking Form 

The four-way division of the work of completing the large bat- 
tery of forms and questionnaries was necessary in order to reduce 
the formidability of the requests made of each faculty member. 

Only the Background Information Questionnaire, the Group 
Dimensions, the Leader Behavior Description Questionnaire, 
(General Form) and the Reputational Ranking Form are of im- 
mediate interest in this paper. Each of these four questionnaires is 
described briefly in the following paragraphs. 

Background Information Questionnaire: This form secured data 
concerning the respondent’s department affiliation, age, sex, aca- 
demic rank, degree held, length of time in his department, length 
of association as a member of an academic faculty, publications, 
affiliation with other professional organizations and general job 
satisfaction. 

The Leader Behavior Description Questionnaire: This question- 
naire is the product of the joint efforts of the staff of the Ohio 
State Leadership Studies. Its development is described by Halpin 
(3), and Fleishman (2). The form used in this study consisted of 





* Seeman (7) (8) for a description of the use of these scales. 
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150 items; however, only 30 of this number were actually scored. 
This decision was based upon an extensive factor analysis of all 
the items. Two scores are obtained: (a) Consideration and (b) 
Initiating Structure. 

Consideration refers to leader behavior that is characterized by 
warm friendly relations with group members and concern with 
group member welfare. The following items are typical of this 
dimension: 

He looks out for the personal welfare of individual members. 

He treats all members as his equal. 

He puts suggestions by the group into operation. 

He gets group approval on important matters before going 
ahead. 

Initiating Structure refers to leader activities that introduce 
organization, new ways of doing things, and new procedures for 
solving group problems, etc. The following are typical Initiating 
Structure items: 

He lets members know what is expected of them. 

He encourages the use of uniform procedures. 

He asks that members follow organizational lines. 

He tries out his new ideas in the group. 

The respondent indicates the frequency with which his leader 
engages in each of these behaviors by selecting one of five adverbs: 
always, often, occasionally, seldom or never. 

The Group Dimension Questionnaire: The development of this 
questionnaire is described in detail by Hemphill and Westie (6). 
It yields scores on 13 dimensions or characteristics of groups: 

1) Autonomy is the degree to which a group functions inde- 
pendently of other groups and occupies an independent position 
in society (13 items). 

- 2) Control is the degree to which a group regulates the behavior 
of individuals while they are functioning as group members (12 
items). 

3) Flexibility is the degree to which a group’s activities are 
marked by informal procedures rather than by adherence to es- 
tablished procedures (13 items). 

4) Hedonic Tone is the degree to which group membership is 
accompanied by a general feeling of pleasantness or agreeableness 
(5 items). 

5) Homogeneity is the degree to which members of a group are 
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similar with respect to socially relevant characteristics (15 items). 

6) Intimacy is the degree to which members of a group are 
mutually acquainted with one another and are familiar with the 
more personal details of one another’s lives (13 items). 

7) Participation is the degree to which members of a group apply 
equal effort to group activities (10 items). 

8) Permeability is the degree to which a group permits ready ac- 
cess to membership (13 items). 

9) Polarization is the degree to which a group is oriented and 
works toward a single goal which is clear and specific to all mem- 
bers (12 items). 

10) Potency is the degree to which a group has primary signifi- 
cance for its members (15 items). 

11) Stability is the degree to which the group resists changes in 
its size and in turnover of its members (5 items). 

12) Stratification is the degree to which a group orders its mem- 
bers into status hierarchies (12 items). 

13. Viscidity is the degree to which members of the group func- 
tion as a unit (15 items). 

The respondents (faculty members) describe their departments 
by indicating how well the items assigned to each dimension 
characterize their respective departments as groups. 

The Reputational Ranking Form: This form is based on the 
methodology of the “nominating technique” (6), (10). The form 
was prefaced with these directions: 

“Tt is desirable to ascertain an estimate of departmental ‘reputa- 
tion’. Not including your own department, rank the five departments 
in the Liberal Arts College that have general reputations on the 
campus for being ‘Best Led’ or ‘Best Administered’ departments. 

“You are not asked to give you own personal evaluation, but 
asked to give the general reputation of the department, using as 
your point of reference what ‘most people in the College of Liberal 
Arts would agree’ that a given department has as a given reputa- 
tion. For example, you may personally think a certain department 
is poorly led but know that the reputation of that department on 
the campus is generally higher than your own personal opinion 
would rate it. In making the ranking use this general campus 
reputation.” 

Similar instructions also were given with a request for the five 
departments that were reputed to be “Least Well Led” or “Least 





EE 
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Well Administered.” With completion of the form the respondent 
had indicated his knowledge of the campus reputations of ten 
departments and by implication had placed the 12 remaining de- 
partments in a category not distinguished by reputations of being 
either well administered or poorly administered. 


Analysis of Data 


The analysis of data was planned in a manner to provide tenta- 
tive answers to many questions that are raised when reputational 
information is suggested as a criterion of quality of administration. 
Among these questions are: 

1) Is there any degree of consistency or agreement among the 
individuals who report knowledge of reputations of the different 
departments? 

2) Are these reputational rankings made by individuals qualified 
to know existing campus opinions? 

3) What characteristics of a department or of the behavior of 
its chairman are reflected in the department’s campus reputation 
with respect to quality of administration? 


Scoring of the Reputational Rankings 
A relatively arbitrary scoring procedure was adopted for the 
purpose of summarizing the data secured by the reputational 
ranking forms. Each nomination received by a department was as- 
signed the weight according to the following scheme: 


Nomination Weight 


First Best Administered 5 
Second Best Administered 4 
Third Best Administered 3 
Fourth Best Administered 2 
Fifth Best Administered 1 
First Less Well Administered —5 
Second Less Well Administered —4 
Third Less Well Administered —3 
Fourth Less Well Administered —2 
Fifth Less Well Administered —1 
Not nominated for either category 0 


The score derived for each department was the sum of the 
weights assigned to the nominations that were received. Scores 
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TaBLE I—DEPARTMENTAL ADMINISTRATION ‘‘REPUTATION ScorEs”’ 
DERIVED FROM Two MatcHep SAMPLES OF RESPONDENTS 
AND FROM THE ToTAL NUMBER OF RESPONDENTS 














Sample A (N = 33) Sample B (N = 33 ) Total Sample (N = 133) 
Dept. Scores Dept. Scores Dept. Scores 
A 9 A 2 A 16 
B —29 B —30 B —83 
C 47 C 80 C 203 
D —46 D —40 D —112 
E 49 E 73 E 223 
F —9 F —18 F —14 
G —53 G —37 G —138 
H 16 H 9 H 31 
I 45 I 19 I 100 
J —41 J — 54 J —122 
K 89 K 72 K 252 
L —8 L —5 L —24 
M 0 M 0 M 4 
N 101 N 68 N 305 
O —35 O — 26 O —87 
P —74 P —70 P —271 
Q ~1 Q 9 Q 13 
R —29 R —12 R —53 
8 —27 8 —3l1 8 —80 
T 19 T —2 = 25 
U —17 U —12 U — 56 
V 3 V 1 = 3 




















ranged from —271 to 305. The distribution of the scores for the 22 
departments is given in the third column of Table 1. 


III, RESULTS 


The presentation of the results of the analysis of the data will 
be organized in respect to the three major questions that were 
asked about the characteristics of ‘“‘reputation for being well 
administered” as a criterion of administrative effectiveness. 

What is the degree of consistency or agreement among the reputa- 
tions reported for the various departments? This question was an- 
swered by establishing two samples of respondents and determin- 
ing the agreement between the two sets of reputation scores derived 
from the two samples. The two samples of respondents included 





— ——————————— a ie 
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only those who had made relatively complete nominations (8, 9 
or 10, of the five best administered and the five less well adminis- 
tered) and was constituted so as to have an approximately equal 
number of respondents from the various departments in each 
sample. Table 1 shows the “reputation scores” of the department 
as derived from the two samples. The correlation between the two 
sets (Sample A vs. Sample B) of “reputations” scores is .94. If 
the Spearman Brown prophecy formula is applied to this coeffi- 
cient, we may estimate the reliability of these reputation scores as 
97. 

We, therefore, can have considerable confidence in the existence 
of a campus reputation with respect to the quality of depart- 
mental administration. Evidently, this reputation is known and 
can be reported with a high degree of agreement among members 
of the college faculty. 

What are the characteristics of those faculty members who com- 
pleted the Reputational Ranking Form? Not all of the faculty mem- 
bers who coéperated in the study by returning questionnaires com- 
pleted the Reputational Ranking Form. Many faculty members 
indicated on the face of the form that they were unaware of 
campus opinion about the administrative reputation of departments. 
Others made only a portion of the ten nominations requested. It 
was possible to establish two groups of respondents on the basis 


TABLE 2—AGE or RESPONDENTS COMPOSING (a) A GROUP OF 
SEVENTY-THREE WHO RETURNED THE REPUTATIONAL RANKING 
Form CoMPLETED AND (b) A Group oF ONE HUNDRED AND 
ONE WHO RETURNED THE ForM BLANK 








Age Returned Completed Forms Returned Blank Forms 
(N = 73) (N = 101) 

Over 70 2 2 
65-70 5 2 
60-64 6 4 
55-59 8 8 
50-54 16 8 
45-49 14 6 
40-44 10 26 
35-39 7 18 
30-34 5 26 
25-29 0 0 

Under 25 0 1 
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TaBLE 3—LENGTH OF TIME A MEMBER OF THE COLLEGE FACULTY 
OF RESPONDENTS COMPOSING (a) A GrRouUP OF 73 wHO 
RETURNED COMPLETED Forms AND (6b) a Group 
oF 101 wHo RETURNED BLANK Forms 








Length of Faculty Membership mainte is Forms « = tn Forms 
Over 6 years 44 25 
Four to 6 years 5 5 
Two to 4 years 15 31 
One to 2 years 7 21 
Six months to 1 year 2 15 
Less than 6 months 0 4 











of their responses to the Reputational Ranking Form. Group I 
consisted of 73 faculty members who provided at least eight of 
the ten nominations requested. Group II consisted of 101 faculty 
members who completed the other questionnaires in their packets 
but failed to make a single nomination on the Reputational Rank- 
ing Form. A comparison of data available on the characteristics of 
these two groups of respondents indicated that those faculty 
members who provided nominations differed from those who did 
not in that they were considerably older and, what is perhaps more 
important, had been members of the faculty for a longer time. 
(Tables 2 and 3). 

From Table 2 we note that 51% of the respondents who re- 
turned completed forms were over 50 years of age. This compares 
with 24% of the respondents who returned blank forms.‘ 

Referring to Table 3, we see that 60% of the respondents who 
returned completed forms had been members of the faculty 6 years 
or more. This compares with 25 % of the respondents who returned 
blank questionnaires.® 

It is to be expected that campus reputations of college depart- 
ments are known through long associations of faculty members. 
Individuals who are relatively ‘“‘new” on the campus are likely to 
be unaware of events that were significant in establishing a de- 
partment’s reputation with respect to its administrative excellence. 





‘ The difference between the means of the two groups, (47.8 and 40.5) is 
significant at the .01 level (t = 4.8). 

* The difference between the means of the two groups (4.30 and 2.72) is 
significant at the .01 level (t = 4.7). The first three categories of Table 3 
were combined to make this test. 
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We interpret the tendency of more mature (older) and better 
qualified (a member of the faculty for a longer length of time) 
faculty members to complete this form to indicate a face validity 
of the information we obtained. We also must accept the fact that 
judging campus reputation of departments may be a meaningless 
and impossible task for a large portion of the younger faculty. 

Is the reputation of departments for being well administered re- 
lated to the style of the department chairman’s leadership? The packets 
of questionnaires that were distributed to the members of each 
department included the Leader Behavior Description Question- 
naire, General Form. One third of the members of each department 
completed this questionnaire to provide their descriptions of the 
leadership behavior of the chairman of their respective depart- 
ments. A second third of the members of each department com- 
pleted this same questionnaire but with instructions to indicate 
what an “ideal” chairman for their department should do. For 
18 of the 22 departments descriptions were available of the “‘ac- 
tual’ and the “ideal” behavior of the chairmen from at least 2 and 
as many as 17 department members. For each of these 18 depart- 
ments, an average Consideration score and an average Initiating 
Structure score were calculated. Table 4 presents the average 
scores for both “actual” and “ideal” leadership behavior and the 
“discrepancy” between these sets of scores. 

Pearson product-moment correlation coefficients were computed 
to express the relationship between each of the two dimensions of 
actual leadership behavior and the “reputation” score of the de- 
partments. Correlations were also computed to express the relation- 
ship between the absolute discrepancy (either positive or negative) 
between “actual” and “ideal” leadership behavior on the two di- 
mensions and the “reputation” score. These data are presented in 
Table 5. 

By reference to Table 5 it can be seen that there is some relation- 
ship between the style of leadership of the department chairman 
as this is viewed by department members and the department 
reputation on the campus for being well or poorly administered. 
This is especially true if we view the departure of the actual be- 
havior of the department from what is held by members of his 
department to be ideal behavior. Departments that achieve a 
reputation for good administration are those departments led by 
chairmen who attend to both of the facets of leadership measured 
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TaBLE 4—AVERAGE ScoRES ON CONSIDERATION AND INITIATING 
STRUCTURE OF DEPARTMENT CHAIRMEN AS DESCRIBED 
BY THE DEPARTMENT MEMBERS 

















Consideration Initiating Structure 
Department 
Ideal Actual oe Ideal Actual  ... cy 
A* 44.0 45.3 1.3 44.7 42.7 2.0 
B 51.0 42.0 9.0 46.5 30.7 15.8 
C* 49.3 48.1 1.2 45.5 43.5 2.0 
D 43.0 24.3 18.7 34.3 33.0 1.3 
E* 46.4 42.1 4.3 36.9 36.7 2 
F 45.5 38.3 7.2 44.8 44.5 3 
G* 53.0 35.0 18.0 50.0 26.5 13.5 
i 47.8 49.6 1.8 41.8 49.2 7.4 
J 45.5 49.5 4.0 42.0 25.5 16.5 
K* 47.3 41.1 6.2 38.4 38.9 5 
L 47.3 40.0 7.3 42.7 32.5 10.2 
N* 49.0 44.8 4.2 39.6 37.8 1.8 
O 49.3 43.7 5.6 47.0 44.3 2.7 
Q* 48.0 | 47.0 1.0 | 36.3 | 39.0 2.7 
R 50.3 38.3 12.0 40.8 34.0 6.8 
8 50.0 34.0 16.0 40.5 25.0 15.5 
= 46.6 49.2 2.6 39.7 40.2 .5 
U 51.0 43.2 7.8 38.8 35.8 3.0 

















* Above the median on “reputation.” 


in this study, i.e., they concern themselves with (1) organizing 
departmental activities and initiating new ways of solving depart- 
mental problems, and at the same time with (2) developing warm 
considerate relationship with members of the department.® 

If we examine the leadership patterns of department chairmen 
closely we note a conjunctive relationship between Consideration 
and Initiating Structure suggesting that a minimal amount of both 
types of behavior are required for the acquisition of a good repu- 
tation and that an excess of one type of behavior does not com- 
pensate for the lack of the other. An excess of Consideration be- 
havior cannot compensate for a deficiency in Initiating Structure 
and vice versa. If a cutting score of 41 on Consideration and 36 
on Initiating Structure are applied simultaneously, the reputation 





* This finding is congruent with the results of a similar study conducted 
within the Air Force and reported by Halpin (3). 
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TABLE 5—CORRELATION COEFFICIENTS EXPRESSING THE RELATIONSHIP 
BETWEEN THE ADMINISTRATIVE ‘“‘REPUTATION’’ SCORES OF 18 
CoLLEGE DEPARTMENTS AND Four LEADERSHIP SCORES 
OF THE DEPARTMENT CHAIRMEN AS DESCRIBED BY 
DEPARTMENT MEMBERS 

















Score Mean S.D. a Se and 
Reputation 21.7 133.8 _- 
‘*Actual’’ Consideration 42.2 6.4 .36 
Discrepancy between ‘‘Actual’’ and 6.9 5.6 — .52* 
‘Ideal’? Consideration 
‘**Actual’”’ Initiating Structure 36.8 6.7 .48* 
Discrepancy between ‘‘Actual’’ and 6.3 6.9 — .55* 
‘*‘Ideal”’ Initiating Structure 





* A correlation coefficient of .47 is significant at the .05 level. 


of the department in terms of being above or below the median 
are successfully identified in 16 out of 18 cases. (These cutting 
scores were selected by inspection of the data and may have no 
specific significance.) The data for this comparison are given in 
Table 6. 

The Chi squared value for the data of Table 6 is 8.00, signifi- 
cant beyond the .01 level of probability. 

A further means of examining the conjunctive relation between 
these two facets of leadership behavior is available for the dis- 
crepancy score. The total of the two discrepancy scores of depart- 
ment chairmen correlates —.63 (.01 level of probability) with the 


TABLE 6—THE RELATIONSHIP BETWEEN THE REPUTATION ACHIEVED BY 
CoLLEGE DEPARTMENTS AND THE CONSIDERATION AND INITIATING 
STRUCTURE ScorRES OF DEPARTMENT CHAIRMEN TAKEN 








CoNJUNCTIVELY (N = 18) ' 
Chairman’s Leadership 2 ee ny 
Departments 
Score of 41 or larger on Consideration and | A, C, E, I, O 
a score of 36 or more on Initiating Struc- K,N,Q,T 
ture 
Score of less than 41 on Consideration or a F B,D,G, J,L, 
score of less than 36 on Initiating Struc- R, 8, U 
ture 
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reputation scores. These last two reported results suggest strongly 
that if department chairmen meet the expectations of their fac- 
ulties with respect to both Consideration and Initiating Structure 
the department is more likely to achieve a favorable reputation 
on the campus. 

In the remainder of the presentation of results we shall examine 
characteristics of the departments for relationship with their ad- 
ministrative reputation. 

Are the characteristics of their department as seen by department 
members related to the reputation of their department? The Group 
Dimensions Questionnaire was distributed to two thirds of each 
department’s members. The questionnaire provides a score on 
each of 13 group characteristics (see page 388). This score was 
calculated as the average of the descriptions supplied by those de- 
partment members who completed this questionnaire. Average 
scores based on a minimum of two member’s descriptions were 
available for 18 of the 22 departments. Pearson correlation coeffi- 
cients were computed for each of the 13 dimensions and the repu- 
tation score. The correlations are presented in Table 7. 

None of the correlations reported in Table 7 is statistically sig- 
nificant. There appears to be no relationship between the manner 
in which department members characterized their respective de- 


TABLE 7—CORRELATION BETWEEN Eacu OF THIRTEEN Group DIMENSIONS 
(AVERAGE ScoRES) AND DEPARTMENTS ‘‘REPUTATION”’ FOR 
Bering WELL ADMINISTERED (N = 18) 

















Correlation: Reputation vs. Mean S.D. - 
Autonomy 34.4 4.0 .09 
Control 28.7 3.4 — .07 
Flexibility 37.6 5.8 .02 
Hedonic Tone 18.8 2.1 .16 
Homogeneity 33.1 3.7 — .05 
Intimacy 48.4 4.4 — .02 
Participation 46.4 4.5 .29 
Permeability 22.5 4.3 — .04 
Polarization 38.4 5.6 .05 
Potency 51.1 4.6 .05 
Stability 15.1 2.8 — .06 
Stratification 40.7 5.1 .05 
Viscidity 35.5 8.1 .04 





* An r of .47 is required for significance at the .05 level. 
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partments by use of the Group Description Questionnaire and the 
“reputation” of the departments for being well administered. This 
lack of relationship highlights the significance of the earlier re- 
ported relationship between the behavior of the department chair- 
man and administrative reputation. It points to chairman be- 
havior specifically, rather than general impressions about the 
department as the more likely determiner of reputation. 
Are there relationships between “‘demographic”’ characteristics of 
departments and their reputation? Examination was made of each 
of the following “demographic” characteristics of the 22 depart- 
ments and their reputation scores: 
1. Age of faculty members (Average) 
2. Size of the department (Number of members) 
3. Length of service to the department of it’s faculty members 
(Average number of years) 

4. Length of academic experience of each department faculty 
(Average number of years) 

5. Academic rank of members (Per cent Associate Professors or 
Professors) 

6. Educational attainment of faculty of the departments (Per 

cent with Ph.D.) 

7. Publications record of department members (Index based on 

number and “quality” of publications) 

Only one of these seven “demographic” variables showed a re- 
lationship with reputation. It was found that large departments 
tended to receive higher reputation scores (r between number of 
members and reputation = .58, significant at the .05 level). Two 
hypotheses were advanced in an effort to account for this relation. 
First, it was suggested that larger departments might require more 
Initiating Structure behavior on the part of their chairmen and 
since Initiating Structure had been shown to be related to “repu- 
tation” the relationship with department size could be a function 
of the same underlying variables.’ The second explanation involved 
the suggestion that the significance of administration in larger as 
compared with smaller departments is well recognized by those re- 
sponsible for the selection of department heads and that in the 
case of larger departments more care had been exercised. In other 
words, in very small departments, chairmen might be appointed 





7 In previous research a relation between size of group and leader centered 
direction had been observed (4). 
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on the basis of seniority without much regard for other qualifica- 
tions, but in large departments demonstration of ability to assume 
administrative responsibility would be involved in a much larger 
measure in the selection of a chairman. 

The first of the two hypotheses could be checked in part by de- 
termining the relation between the size of the department and the 
Initiating Structure behavior of their chairman. This correlation 
was computed and found to be .11, which is not significant. The 
available evidence rejects the first interpretation. 


IV. SUMMARY AND DISCUSSION 


The campus reputations of 22 departments for being well ad- 
ministrated were secured from members of the faculty of a Liberal 
Arts college by use of a nominating form. As parts of the same re- 
search study data were also available concerning (a) the leadership 
behavior of department chairmen as viewed and as wanted by de- 
partment members, (b) group characteristics of departments as 
viewed by department members, and (c) demographic character- 
istics of the departments. An examination was made of the rela- 
tionship among these data in an effort to assess the usefulness of 
campus opinion as a criterion of quality of administration. The fol- 
lowing conclusions resulted from the analysis: 

1) Administrative reputation of the college department was re- 
liably reported by faculty members. Agreement between two in- 
dependent samples of respondents was very high (r = .94). 

2) Older and more mature faculty members provided a larger 
proportion of the reputation information than “‘younger”’ or “‘new”’ 
members of the faculty. In fact, it appears many new members of 
the college faculty were not aware of the departments’ reputations. 

3) Reputation for being well administered is related to the lead- 
ership behavior of department chairmen as this behavior is de- 
scribed by department members. Those departments with best 
reputations for good administration have chairmen who are de- 
scribed as above the average on both Consideration and Initiating 
Structure and who more nearly meet the behavior expected of an 
ideal chairman. It appears that optimal amounts of both of these 
types of behavior are required in order that a department earn a 
reputation for good administration. 

4) With the exception of size, all group characteristics of the 
departments, both demographic characteristics and those de- 
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scribed by means of the Group Description Questionnaire are un- 
related to reputation for good administration. 

5) Larger departments tend to have better administrative repu- 
tations than smaller departments. This fact is independent of the 
Initiating Structure activity of the chairman and may indicate 
only that more care is exercised in selecting chairmen of large de- 
partments. 

In conclusion, it appears that reputation may provide a cri- 
terion of excellence of administration. Many questions remain to 
be investigated concerning the limits within which such reputation 
may be useful. Certainly, qualified respondents must be found who 
have had opportunity to become acquainted with events and opin- 
ions bearing on quality of administration. However, in many or- 
ganizations a pool of qualified respondents may be available in the 
senior members of the groups. Our study involved only one college 
faculty. We know nothing about how differences between organiza- 
tions may affect the availability and quality of opinion regarding 
administrative reputation. However, in view of the dearth of tech- 
niques available for assessing quality of administrative perform- 
ance and importance of the problem, the suggestion of the useful- 
ness of the reputation criterion reported here should be followed 


up. 
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THE DEVELOPMENT OF SHORTER AND 
MORE USEFUL SELECTION TESTS 


A. PEMBERTON JOHNSON 


Testing and Guidance Division, Newark College of Engineering, Newark, 
N. J., formerly of Educational Testing Service, Princeton, N. J. 


This article describes what has been done within the last few 
years to shorten two selection tests of proved validity. Any one 
might have followed a very similar procedure for shortening either 
of these tests. The technique followed with the second test is, I 
believe, somewhat new. The two tests are: (1) the Pre-Engineering 
Inventory shortened to become the Pre-Engineering Ability Test 
and (2) the 6-hour Law School Admission Test shortened to its 
present 314 hour version. 


SHORTENING THE PRE-ENGINEERING INVENTORY 


The Pre-Engineering Inventory (PEI) was first used in 1943, by 
the Measurement and Guidance Project in Engineering Education, 
then conducted by the Graduate Record Office of the Carnegie 
Foundation for the Advancement of Teaching. Designed primarily 
as a guidance instrument, which could be used for selection, the 
PEI provided seven sub-test scores and a composite of three of 
those sub-test scores. The composite was intended to provide a 
single index most predictive of 1st term engineering college averages 
and was quite effective for that purpose. By July 1949 when I be- 
came Project Director of the Measurement and Guidance Project 
evidence had begun to accumulate that many of the users found 
this test too expensive and too time-consuming, both in the 6-hour 
Long form and the 4-hour Short form. During 1949-50 a number 
of discussions were held with users to define the desirable features 
of a shortened test. In general the users wanted a test as valid as 
the Pre-Engineering Inventory or nearly so but requiring no more 
than 90 minutes testing time, a test which was reliable enough for 
individual guidance yet inexpensive and easy both to administer 
and to score. The users wanted, also, a means of converting scores 
on the new test to scores on the Pre-Engineering Inventory, so 
that their past experience with the earlier test would not be lost. 
One user wanted two scores: one on comprehension of scientific 
reading and one on mathematical ability. 


402 








Development of Shorter Selection Tests 403 


Information available to aid us in revising the test included: 

(1) Product-moment validity coefficients of sub-test and of com- 
posite scores against first-term grades at 12 engineering colleges 

(2) Product-moment intercorrelations of sub-test scores 

(3) Sub-test and composite score estimated product-moment 
reliability coefficients 

(4) Item validity coefficients at one large engineering college 

(5) Speededness data. 

Examination and manipulation of these data showed that raw 
scores on the two sub-tests, Comprehension of Scientific Materials 
and General Mathematical Ability, when added, would predict first 
term engineering college grades almost as well as the Composite 
score which is derived from these two sub-tests combined with a 
third one, Technical Verbal Ability. Median validities of these two 
sub-tests at 12 engineering colleges were: for Comprehension of 
Scientific Materials, .50; and for General Mathematical Ability, 
.58. The median intercorrelation was .70. The multiple R for the 
two in predicting first term grades based on the above data was .59 
as compared with .60 for the Composite score based on three parts. 
Odd-even raw score reliabilities for these sub-tests were: .88 for the 
100 Scientific Comprehension items and .89 for the 90 Mathematics 
items. Time limits were: Scientific Comprehension, 90 minutes, and 
Mathematics, 70 minutes. 

Estimated validity and reliability of single test combining half- 
length portions of the two sub-tests were: validity .57, reliability 
.90. These values were believed to justify use of the shortened test 
for selection and guidance of individual students. Study of the 
item validity data revealed 10 items in each sub-test with item 
validities of essentially zero. Study of the speededness data sug- 
gested that slightly more time should have been provided for both 
sub-tests in order to make them essentially power tests. 

In view of the above information, it was decided to produce 
from the 90 better Scientific Comprehension items and the 80 bet- 
ter Mathematics items two essentially half-length experimental 
forms equivalent as possible, allowing somewhat more time for 
both item types and retaining each type in a separately timed part. 
Thus two 80-minute forms comprising 45 Scientific Comprehension 
items in 45 minutes and 40 Mathematics items in 35 minutes were 
prepared. They were administered during the September 1950 
Orientation week in 1-2 and 2-1 order to random halves of a large 
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entering engineering freshman class, N = 610. The previous year’s 
class had taken the Pre-Engineering Inventory. Both classes took 
the College Board SAT and Physics Achievement Tests which 
were used as means of equating scores on the experimental tests 
to scores on the PEI. One of the experimental forms was published 
in July 1951 as the Pre-Engineering Ability Test. The manual re- 
ports a Kuder-Richardson formula 20 reliability of .90 for 305 
engineering freshmen and an estimated average validity of .57 
against first term grades. A study at Carnegie Institute of Tech- 
nology by Dr. Roland C. Moore reports a validity coefficient 
against first semester weighted averages there of .68 for 260 stu- 
dents to whom the 80-minute Pre-Engineering Ability Test was 
administered. This figure was no lower than correlations of the 
PEI against first semester averages for the PEI Composite in the 
previous years. The findings from use of the test appear to be con- 
sistent with the prior theoretical estimates of reliability and va- 
lidity. Thus it was possible to provide an 80-minute test with a 
single score virtually as valid as the composite score on the 4- to 
6-hour test, together with comparable scores for the earlier and the 
newer test. Two reliable part scores could not be obtained in 80 
minutes with the kind of test material used. I was responsible for 
the planning and arranging for the execution of this work which 
was supervised largely by Dr. William H. Angoff of our Statistical 
Analysis Department and Mr. Bernard S. Cayne of the Test De- 
velopment Department. 


SHORTENING THE LAW SCHOOL ADMISSION TEST 


The shortening of the Law School Admission Test (LSAT) pre- 
sented a somewhat different problem. The first two forms of that 
test comprised 10 to 12 sub-tests, the raw scores of which were 
summed for direct conversion to the single standard scores re- 
ported to users. Test analyses and validity studies of these first 
two forms showed odd-even coefficients of reliability for these un- 
speeded aptitude tests of the order of .95-.97, and suggested the 
possibility of shortening these forms without reducing validity 
coefficients of reported scores. These validity coefficients against a 
criterion of first year law grades, ranged upward to .68 with a 12 
school mean of .40 for the test as compared with .38 for pre-law 
averages. (The first year law school grade criterion has a greater 
judgmental and less mathematical basis than the first year engineer- 
ing college criterion.) 
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As a means of providing greater test security, utilizing available 
items efficiently, and of making greater funds available for the 
development and tryout of promising new materials, and for other 
services to users, it was decided to consider how best to develop 
four 34-hour tests from the earlier forms and from new pretested 
materials. Several staff discussions as to the best method of 
shortening the earlier forms of the LSAT led to plans for an em- 
pirical comparison of five selected methods applied to one form. 
The study was planned and executed under the direction of Dr. 
William B. Schrader and Dr. Marjorie A. Olsen who reported it at 
the 1952 meetings of the American Psychological Association. For 
convenience it dealt with reducing six sub-tests which required 
190 minutes testing time to a shortened test of 135 minutes, even 
though the original and present forms of the LSAT are both longer 
than these times indicate. 

The analysis was based on LSAT total and sub-test scores and 
on first year law school average grades for two random cross-valida- 
tion sub-samples of 500 from among 1000 first year students in 
nine law schools. In order to combine criterion data from all law 
schools first year grades were converted to a common scale; the 
regression of grades on LSAT total score for each school was used 
to determine the appropriate mean and standard deviation for the 
group on the common criterion scale. 

The necessary calculations and item selections were made for 
each of the five methods with data from each of the two sub- 
samples so that trial scoring keys could be used for cross-validation 
on the opposite sub-samples. 

Keys based on one sub-sample were then used to rescore the 
answer sheets for the other sub-sample, and the new trial scores 
by each method were correlated with the first-year law school 
grades. 

Keys based on three item-analysis methods were tried out: 

1) Internal consistency item analysis—items showing the highest 
biserial correlations with LSAT total score were selected for this 
trial key. 

2) External criterion item analysis—items having the highest 
biserial correlations with first-year law school grades were included 
in this key. 

3) E/I ratio, substantially that described by Gulliksen in the 
Proceedings of the 1948 Invitational Conference on Testing Prob- 
lems—items with the highest ratios of point biserial correlation 
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with grades (external criterion-E) to point biserial with sub-test 
score (internal criterion-I). 

Keys for the two following methods, each concerned with esti- 
mating optimal lengths of time for sub-tests, were also explored: 

4) Horst’s method for condition III is described in his June 
1951 article in Psychometrika. This procedure takes into account 
the validity, reliability and original length of each sub-test and 
its intercorrelations with other sub-tests. Under the conditions 
that total time (or number of items) is fixed and the score is total 
number right, it determines the optimal time for each sub-test so 
that the validity of the total score is maximized. In this study it 
was not quite possible for some sub-tests to include the optimal 
number of items because the number required was greater than 
the number available. 

5) The linear method—In principle, this procedure is similar 
to Horst’s method, but by making certain assumptions, it leads 
to a markedly simplified solution. It takes into account the validity 
and original length of each sub-test and its intercorrelations with 
other sub-tests. It specifies an amount of time for each sub-test in 
the shortened form such that the raw score regression weights of 
all of the sub-tests are approximately equal. It assumes that, with 
moderate changes in the length of any sub-test, standard score 
regression weights for the sub-test scores change proportionately 
with length of time for the sub-test. In effect the latter assumption 
is equivalent to assuming sub-test reliabilities of 1.00. 

If these assumptions are accepted, the solution for optimal sub- 
test times follows readily. For any sub-test, its proportion of the 
total test time is simply the product of its standard score regression 
weight and its original time allotment divided by the sum of such 
products for all sub-tests. Within each sub-test, the items with the 
highest biserial correlations with the sub-test score are chosen to 
the extent required to meet the needed sub-test length. 

Surprising enough, within each sub-sample, the cross-validity 
coefficients for the five different methods were virtually the same. 
In one sample they ranged from .483 to .496, a difference of .013; 
in the other, from .569 to .576, a difference of .007. The simplest 
method by far, the linear method, had a slightly higher mean 
validity coefficient in the two samples than any other method. 

The linear method was applied to nine sub-tests in a second form 
of the LSAT using data for two other random sub-samples of 500 
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law students each. In the multiple regression equation for these 
nine sub-tests, four sub-tests showed appreciable beta weights and 
the remaining five showed negligible weights, when used with the 
other four. A trial 135-minute key, developed to provide optimal 
time allotments among these four tests and to use the most in- 
ternally homogeneous items of each sub-test showed a cross-validity 
coefficient of .466 as compared with .454 for the 300 minute original. 
This is a striking example of the effectiveness of the method. 

The findings of the linear method applied to the two earlier 
forms of the LSAT, together with tests and item analysis data 
from these forms, and a careful inventory of pre-tested and ana- 
lyzed items of each type made possible the development of a plan 
of assembly of four alternate 314-hour forms which maximize the 
validity of all. The data available strongly suggests that the four 
new forms will be at least as valid as the earlier long forms. A 
routine follow-up validity study on the first of the alternate forms 
is planned, however, for next fall when the first new criterion 
data should be available. Reliabilities of two of the new forms are 
.913 and .927. The four new forms include carefully devised sets 
of items enabling the equating of scores on one pair of the new 
forms to scores on the first long form and on another pair to those 
on the second long form. Linking items are provided to link the 
new forms across pairs. 

Planning and execution of the Law School Admission Test work 
described above has involved a number of the ETS staff, princi- 
pally, Dr. Ledyard R. Tucker, Drs. Schrader and Olsen of the 
ETS Statistical Analysis Staff and Dr. Edith Huddleston of the 
ETS Test Development Department. I have served as Project 
Director suggesting goals and aiding in planning to achieve the 
ends mutually agreed upon. 


DISCUSSION 


You may have noticed that although the details of methodology 
in shortening the engineering and law tests were different, the 
basic principles followed were somewhat the same. Initial shorten- 
ing was by elimination of sub-tests which added negligibly to total 
test validity. The length of each included sub-test was determined 
partly or wholly on the basis of multiple regression data. The items 
required for each sub-test were chosen by some appropriate item 
selection technique. 

















A COMPARATIVE STUDY OF VALUES AND 
CRITICAL THINKING SKILLS OF A GROUP 
OF EDUCATIONAL WORKERS 


JOSEPH C. BLEDSOE 


The University of Georgia 


INTRODUCTION 


Formal school experiences such as college courses frequently 
have as their objectives the development of certain attitudes, in- 
sights, and skills. Such courses sometimes make claims which may 
not be justified. It is admittedly somewhat difficult to assess some 
of the more important attitudinal and behavioral changes which 
may take as a result of such experiences. Among mature students, 
moreover, changes in attitudes and practices are more difficult to 
bring about since basic patterns have largely been established. 
This study reports an attempt to measure, at least partially, cer- 
tain value-patterns and critical thinking skills of a group of grad- 
uate students with particular reference to changes which are re- 
vealed following an intensive six-weeks course in research methods. 
Certain correlates of value scores and critical thinking scores with 
factors such as age, sex, and area of professional work are also 
presented. 


PURPOSES 


Specific purposes of the study, which may be roughly regarded 
as hypotheses, are outlined as follows: 

1) What are the patterns of values and critical thinking skills 
of this group of graduate students in education? Can a distinctive 
hierarchy of values for such a group be established? 

2) What changes, if any, in dominant life-value are brought 
about in a period of six weeks among a group of relatively mature 
educational workers? (The six weeks period is characterized by an 
intensive course in research methodology.) Does such a course 
which emphasizes practical or applied research result in a signifi- 
cant increase in interest in theoretical or practical values? 

3) What changes, if any, in critical thinking skills are apparent 
following such a course? Can improvement in these skills be brought 
about in such a short time? 
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4) Are there significant sex and age differences in values and in 
critical thinking skills? 

5) Are there significant differences in values and in critical 
thinking skills among three types of workers, viz., administrators, 
elementary teachers, secondary teachers? 

6) Are there significant correlations between value scores, criti- 
cal thinking scores, scores on subject-matter tests on research 
methodology, age and sex? 

Brief Description of Group Studied, Course Taught, and Tests 
Employed. The subject group consisted of 46 graduate students 
enrolled in a graduate course entitled ‘‘Methods of Applied Re- 
search in Education.” There were 22 men and 24 women; 16 per- 
sons were employed during the regular school year as administra- 
tors (mostly principals), 10 as elementary school teachers, and 20 
as high-school teachers. The group ranged in age from 25 to 51, 
with a mean age of 36 years and a standard deviation of seven 
years. In experience, the group ranged from two years to 26 years, 
with a mean of 11 years. This particular section, a summer-school 
group, was apparently representative of graduate students in 
education enrolled at the University of Georgia. 

The course in research methods, required of all graduate stu- 
dents majoring in education, has as its broad objectives the devel- 
opment of insights, skills, and competencies needed to become ef- 
fective producers and consumers of educational research, with 
special reference to applied research. Among the more detailed 
objectives are an increased awareness of the need for applying re- 
search methods to the solution of problems, a clear conception of 
applied research as a means of solving problems, some knowledge 
of the more widely-used methods and tools of research in educa- 
tion, increased competence in interpreting the results of research, 
and general understanding of what is usually involved in preparing 
the research report. Among the procedures employed are lecture- 
discussions, presentation by students of proposed projects, and 
critiques by instructor and students of studies reported in the 
literature. 

The inventory of values employed was the Allport-Vernon- 
Lindzey Study of Values, Revised (1951) Edition. The Study of 
Values is designed to measure the relative value of six basic in- 
terests in personality: the Theoretical, Economic, Aesthetic, Social, 
Political, and Religious. The classification is based upon Eduard 
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Spranger’s T'ypes of Men, a work which assumes that the person- 
alities of men are best known through a study of their values or 
evaluative attitudes. The scale consists of a number of questions, 
based upon a variety of familiar situations to which two alternative 
answers (in Part I) and four alternative answers (in Part II) are 
provided. In all there are 120 answers, 20 of which refer to each of 
the six values. Usual time required is approximately 20 minutes.! 

The instrument employed for determining critical thinking skills 
was the Watson-Glaser Critical Thinking Appraisal. This instru- 
ment is designed to provide problems and situations which require 
the application of some of the important abilities involved in 
critical thinking. The subtests of the Critical Thinking Appraisal 
are (1) inference; (2) recognition of assumptions; (3) deduction; 
(4) interpretation; and (5) evaluation of arguments. There are 
two forms available, each of which has 99 items; time required is 
approximately 40 minutes.’ 

In addition to these standardized tests, the author, as instructor 
of the course, gave a 116-item teacher-made test based on the 
subject matter of research methodology. This test was of the 
multiple-choice and alternative-response type, and represented the 
fifth revision (sixth form) of the test after use with preceding 


classes. 


PROCEDURES 


At the first class meeting, Form Am of the Watson-Glaser 
Critical Thinking Appraisal and the Allport-Vernon-Lindzey Study 
of Values were administered in one sitting. Students were later 
informed, upon their request, in confidential interviews of their 
individual percentile scores on the Watson-Glaser Test and the 
order or hierarchy of the individual scores on the Allport-Vernon- 
Lindzey Values inventory. The course proceeded in the usual 
manner as briefly described in a preceding paragraph. At the end 
of the six-week period, the subject matter test was given and the 
two standardized, instruments were administered, including the 
alternate Form Bm of the Watson-Glaser Critical Thinking Ap- 





1 Gordon W. Allport, Philip E. Vernon, and Gardner Lindzey. Manual 
of Directions, Boston, Houghton Mifflin Company, 1951. 

2 Goodwin Watson and Edward M. Glaser. Manual, Watson-Glaser 
Critical Thinking Appraisal. Yonkers-on-Hudson, New York, World Book 


Company, 1952. 














Values and Skills of Educational Workers 411 


praisal. Normative data such as sex, age, and area of professional 
work were recorded. Measures of central tendency, variability, 
and correlation were then computed on these data and test scores. 
Tests of significance involving differences in means, standard er- 
rors of the differences, and the “‘t”’ ratio of significance were em- 
ployed. 


ASSUMPTIONS AND LIMITATIONS 


The following assumptions and limitations of the study should 
be mentioned: 

1) It is assumed that the two standardized instruments em- 
ployed have at least some validity for the purposes of this study. 
The validity of the findings is limited to a large extent by the 
validity of the instruments employed. 

2) The study was primarily exploratory. No control group was 
employed to determine what growth or changes may have taken 
place in activities other than experiences of this particular type; 
generalization about cause-and-effect relationships is therefore 
somewhat limited. 

3) It is assumed that these value patterns and thinking skills 
are important outcomes, and that students are sufficiently con- 
cerned that they give accurate responses. 

4) Most students in the subject group were taking one addi- 
tional course during this period; this course may have been of the 
same or of a different type, which may have augmented or de- 
tracted from changes produced by the experiences of the research 
methods course. 


SUMMARY OF FINDINGS 


Table I presents mean scores for both the beginning and the 
end-of-course administrations of the two standardized inventories, 
along with standard deviations and differences in means. Differ- 
ences in value scores from pre-test to end-test are very small and 
none of them are significant; largest difference comes in the rela- 
tively unstable ‘social’? subscale. The increase in mean score on 
the Critical Thinking Appraisal scale of approximately four points 
meets the five percent level of confidence test. Since the course 
had as one of its aims that of an increased awareness of the need 
for a critical attitude toward problems, it appears reasonable to 
conclude that students may have sharpened some of these skills. 
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TaBLE I—MEAN Scores, STANDARD DEVIATIONS, AND DIFFERENCES IN 
MEANS OF SCORES ON ALLPORT-VERNON-LINDZEY StTuDY OF VALUES 
AND WaTsoN-GLASER CRITICAL THINKING APPRAISAL 




















Mean Scores Standard Deviations DiSerences 
End-Minus 
Pre-Test | End-Test | Pre-Test | End-Test Pre-Test 
Theoretical 41.20 41.09 6.87 9.43 —.1l 
Economic 41.09 41.65 7.11 7.39 .56 
Aesthetic 35.35 35.39 7.05 7.67 04 
Social 38.11 36.80 5.62 6.30 —1.31 
Political 37.70 38.83 5.54 7.40 1.13 
Religious 46.41 46.65 8.64 7.28 24 
Critical Thinking Ap- | 63.28 67 .32 12.71 10.58 4.04* 
praisal 











* Significant at five per cent level of confidence. 


It also appears fairly certain that a short course of this type is 
not likely to produce a significant change in basic values. It may 
be, however, that the real impact of such an experience is not 
immediately felt, and that with increased practice in use of a 
problem-solving approach, such values may shift more than can 
immediately be measured. Since no control of outside influences 
seems feasible, no easy way is suggested to discover whether or 
not such a conclusion is warranted. 

Table II presents differences among the subscales of the Allport- 
Vernon-Lindzey Study of Values for the subject group. This is for 
the first administration of the test, which, as has been demon- 
strated above, is not significantly different from the later scores. 
Inspection of the table reveals a definite hierarchy of values for 
the group-as-a-whole. Highest value is that of the Religious, the 
mean score of which is significantly higher than the mean scores 
of the other five subscales. Second in order of importance and ap- 
proximately equal are the values Theoretical and Economic. 
Next in order and approximately equal are the scales Social and 
Political. Finally and least important to the group-as-a-whole is 
the Aesthetic value. To determine whether or not this hierarchy 
of values is representative of teachers-in-general, or is peculiar to 
this particular group, or perhaps representative of teachers in 4 
Southern state or in the Southern region, is not within the scope 
of this study. It is interesting to note, however, that, as compared 
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TABLE II—DIFFERENCES BETWEEN MEAN ScorEs AMONG 
SUBSCALES OF THE ALLPORT-VERNON Stupy oF VALUES 














Subscale (N = 46) Economic Aesthetic Social Political Religious 
Theoretical ll 5.85** 3.09* 3.50** —5.21** 
Economic 5.74** 2.98* 3.39* —5.32** 
Aesthetic —2.76* | —2.35 —11.06** 
Social 41 —8.30** 
Political —8.71** 

















Read the table as follows: The difference in mean scores of the Theo- 
retical scale and the Economic scale is .11. Plus differences are favorable 
to the subscale listed vertically; minus differences are favorable to the hori- 
zontal subscale. 

* Significant at 5% level of confidence. 

** Significant at 1% level of confidence. 


TABLE III—INTERCORRELATIONS BETWEEN SUBSCALE SCORES 
OF THE ALLPORT-VERNON StTupy oF VALUES 











Second Administra- 
Subscale Economic|Aesthetic| Social Political | Religious |tion of Same Subscale 
(at End of Six Weeks) 
Theoretical | —133 | —141 | —501** | —149 | —165 716** 
Economic —325*| —402** | —040 | —199 697** 
Aesthetic 246 —357*| —392** 782** 
Social —098 | —078 435** 
Political —131* 292* 
Religious 558** 




















Read table as follows: Correlation between subscale scores on Theoreti- 
cal and Economic is —.133, etc. Repeat test reliability after six weeks of 
Theoretical scale is .716. Decimals have been omitted. 

* Significant at 5% level. 

** Significant at 1% level. 


with national norms, Southerners, particularly Southern women, 
tend to score high on the Religious subscale.* 

Table III presents intercorrelations between subscale scores of 
the Allport-Vernon-Lindzey Study of Values. All intercorrelations 
are negative, with the exception of that between the Aesthetic- 
Social scales. This correlation of .246 is not, however, significantly 
different from zero. Highest negative correlation comes in the 





*Susan W. Gray. ‘‘A Note on the Values of Southern College Women: 
White and Negro.”’ Journal of Social Psychology, XXXV (1947), 239-241. 
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TaBLE IV—CoRRELATIONS BETWEEN SUBSCALE SCORES OF THE 
ALLPORT-VERNON Stupy or VALUES, WaTsON-GLASER 
CriTICAL THINKING APPRAISAL ScoRES, SUBJECT 
Matter Test Scores, AND AGE 








Watson- Watson- Research 
Glaser CTA,|GlaserCTA,| Subject- Age 

Form A Form B_ /|Matter Test 
Theoretical 161 092 048 —138 
Economic —319* | —305* | —256 —007 
Aesthetic —219 — 200 023 565** 
Social 149 105 368** | —002 
Political 242 198 018 —156 
Religious 049 117 —059 —194 
Watson-Glaser CTA, Form A 604** 636** | —348* 
Watson-Glaser CTA, Form B 551** | —366** 
Research Subject-Matter Test —135 

















* Significant at 5% level. 
** Significant at 1% level. 
Decimals have been omitted. 


relationship between the Theoretical and Social Scales. Correlations 
significant at the one per cent level include the Theoretical-Social, 
Aesthetic-Religious, and Economic-Social. Also given in Table III 
are the correlations between the first and second administrations 
of the tests which essentially constitute repeat test reliability 
estimates after a period of six weeks. These reliability coefficients 
range from a low in the case of the Political Scale of .292 to a high 
of .782 for the Aesthetic Scale. The mean reliability coefficient 
using the z transformation was .61. This reliability coefficient 
is considerably lower than that reported in the Manual of Direc- 
tions of the Test (.82). 

Table IV presents the correlations between the scales of the 
values inventory, the Watson-Glaser Critical Thinking Appraisal 
scores, scores in the subject matter test on research methods and 
chronological age. Most of the correlations between the scales of 
values and the two forms of the Watson-Glaser Tests are insignifi- 
cant. Only in the instances of the relationship between the Eco- 
nomic scale and the two forms of the Critical Thinking Appraisal 
is the correlation significant. The repeat-test reliability, after six 
weeks, of the two forms of the Watson-Glaser Critical Thinking 
Appraisal, is .604, which is considerably lower than the coefficients 
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TaBLE V—Sex DIFFERENCES IN MEAN SCORES ON THE SUBSCALES 
OF THE ALLPORT-VERNON-LINDZEY StuDY OF VALUES AND 
OTHER VALUES 




















Mean Scores 
Difference 
Variable (Male minus 
Male Female Female) 
(N = 22) (N = 24) 
Values 
Theoretical 43.13 39.42 3.71 
Economic 40.05 42.04 —1.99 
Aesthetic 31.68 38.71 —7.03** 
Social 37.50 38 .67 —1.17 
Political 39.41 36.13 3.28 
Religious 47.82 45.13 2.69 
Watson-Glaser CTA, Form A 66.09 60.71 5.38 
Watson-Glaser CTA, Form B 69.36 65.42 3.94 
Research Subject-Matter Test 56 .36 58.17 —1.81 
Age 32.86 39.21 —6.35** 





* Significant at 5% level of confidence. 
** Significant at 1% level of confidence. 


reported in the manual. (These coefficients range from .79 to a 
.95.) A significant correlation between the scores on the Social 
subscale and scores on the subject matter test was found, as well 
as significant correlations between the two forms of the Watson- 
Glaser Critical Thinking Appraisal and the Research Subject 
Matter Test. A highly significant and somewhat surprising correla- 
tion of .565 was found between scores of the Aesthetic scale and 
age. All other values subscales were negatively and insignificantly 
correlated with age. Negative correlations of .348 and .366, both 
significant, were found between the Watson-Glaser Critical Think- 
ing Appraisal and age. 

Table V presents sex differences in mean scores on the several 
variables. Both men and women scored highest in the Religious 
scale and there seemed to be considerable similarity in the hier- 
archy of values between men and women. The only significant 
difference was’in the instance of the Aesthetic scale in which women 
scored substantially higher than men. That this group is an atyp- 
ical one, as compared with the general population, is brought out 
by the fact that the Religious scale, particularly in the case of men, 
is much higher than for the general population. In this particular 
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TaBLE VI—MEAN ScorREs AND DIFFERENCES IN MEANS ON SUBSCALES OF 
THE ALLPORT-VERNON StTupY OF VALUES, AND OTHER VARIABLES OF 


A Group or GRADUATE STUDENTS CLASSIFIED BY AREA OF 
PROFESSIONAL WorK 











Mean Scores Difference in Means 
Values Secon- | Admin. | Admin. 
(N'=10)| (N= 16)] 487, | minus | minus” | Feng minus 
Theoretical 42.4 | 39.3 | 41.2 3.08 1.18 | —1.90 
Economic 37.6 | 43.1 | 42.9 | —5.54"| —5.34* .20 
Aesthetic 36.5 | 38.5 | 32.9 | —2.00 3.65 5.65* 
Social 39.8 | 39.0 | 36.3 81 3.51 2.70 
Political 38.1 | 36.6 | 37.9 1.46 .21 | —1.25 
Religious 45.1 | 43.8 | 48.8 1.26 | —3.74 | —5.00 
Watson-Glaser CTA, | 62.4 | 60.9 | 65.2 1.48 | —2.82 | —4.30 
Form A 
Watson-Glaser CTA, | 67.3 | 65.8 | 68.2 1.51 | —0.89 | —2.40 
Form B 
Research Subject-Matter | 60.4 | 56.8 | 55.1 3.58 5.28 1.70 
Test 
Age 38.5 | 39.5 | 32.7 | —1.00 5.85*| 6.85** 























* Significant at 5% level of confidence. 
** Significant at 1% level of confidence. 


sample, men scored even higher than women on this scale. For 
both men and women, Religious, Theoretical and Economic scales 
were the first three choices, and the Political, Social and Aesthetic 
were the least preferred. Men scored highest on the Critical Think- 
ing Appraisal Test for both forms; however, these differences were 
not significant. Women scored higher on the subject matter test; 
again, this difference was not significant. The difference in mean 
ages between men and women was significant. 

Table VI presents mean records and differences in means among 
the several variables of the group classified by area of professional 
work. The differences in means among the three groups were in- 
significant, except for a significant preference for the Economic in 
the case of the elementary teachers and the secondary teachers, as 
compared with administrators, and a significant preference for the 
Aesthetic for the elementary teachers as compared with the 
secondary teachers. This latter preference may be suggested by 
the fact that most of the elementary teachers were women and a 
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substantial number of the secondary teachers were men. The dif- 
ferences in age between the administrators and secondary teachers 
and the elementary teachers and secondary teachers were sig- 
nificant. 


SUMMARY 


This study has reported mean scores in certain broad categories 
of values as revealed by responses to the Allport-Vernon-Lindzey 
Study of Values and also mean scores on the Watson-Glaser Critical 
Thinking Appraisal of a group of 46 mature graduate students of 
the University of Georgia both before and after an intensive 
course in research methodology. 

An analysis of these scores revealed a hierarchy of values for the 
group with the Religious scale listed as of greatest preference, 
followed by the Theoretical and Economic (on the same level), 
the Social and Political (on the same level), and finally the Aes- 
thetic. No significant differences in pre-course and end-course 
mean scores in values were found. A fairly significant (five per 
cent level) difference in mean scores on the Critical Thinking Ap- 
praisal from pre-course to end-course was obtained. Differences in 
mean scores for men and women for the values inventory were 
found to be insignificant except for the Aesthetic scale, in which 
women made a higher mean score. Other differences (except age) 
were insignificant. 

















THE DAVIS-EELLS GAMES AS A MEASURE 
OF THE INTELLIGENCE OF POOR READERS 


JOSEPH JUSTMAN and MIRIAM ARONOW 


Bureau of Educational Research, Board of Education of the 
City of New York 


For many years, the New York City schools have used the Pint- 
ner Intermediate test as a measure of general intelligence at the 
sixth grade level. Criticism of the relatively high reading load of this 
instrument has led the Bureau of Educational Research to under- 
take a comparative study of several other group measures of in- 
telligence with a view to determining whether a more satisfactory 
test might be utilized. 

One of the tests considered for possible use was the Davis-Eells 
Games, Elementary. Bureau staff members administered this 
measure in nine sixth grade classes drawn from six elementary 
schools. A total of 246 pupils, to whom the Pintner Intermediate 
Test had previously been administered, were tested. Both tests 
were administered within the same school month. This paper re- 
ports the findings of a comparison between the results obtained on 
the two instruments. 

Character of the Sample.—Because the study was concerned pri- 
marily with the appropriateness of the tests for poor readers, the 
nine classes in which the Davis-Eells Games were administered 
were selected to constitute a sample somewhat below the average 
of the sixth grade population at large. The group contained a high 
proportion of third and fourth grade readers who had difficulty in 
reading the Pintner Test and whose general capacity, it was felt, 
was underestimated by the Pintner Test. It was thought that an 
instrument like the Davis-Eells Games, which required no reading, 
would yield higher IQ’s for the group than the Pintner Test, and 
that the mean IQ on the Davis-Eells Games would probably lie 
between 95 and 100. 


THE FINDINGS 


1) Differences in Mean IQ—As a first step in the analysis of the 
obtained data, mean IQ’s and standard deviations were computed 
for the two tests. In addition, the product-moment correlation be- 
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TaBLE 1—MEAN IQ’s AND CORRELATION BETWEEN IQ’s ON Davis-EELLs 
GAMES AND PINTNER TEST 














Test N Mean SD r 
Davis-Eells 246 89.3 13.3 50 
Pintner 246 88.5 14.4 











tween the IQ’s obtained through the administration of the Davis- 
Eells Games and the Pintner Test was determined. These data are 
presented in Table 1.1 

Although the Davis-Eells Games and the Pintner Test show only 
moderate agreement in the relative ranking of pupils, less than a 
one-point difference in mean IQ appears between the two tests. 
This difference is not statistically significant. The mean IQ of be- 
low 90 obtained on the Davis-Eells Games does not conform to the 
expected pattern. 

2) Differences in Individual IQ’s—The mean scores presented in 
Table 1 provide a gross comparison. The differences in the IQ’s 
obtained by individual pupils on the Davis-Eells Games and the 
Pintner Test are summarized in Table 2. 

When the two tests are compared, approximately the same pro- 
portion of the pupils obtained their higher IQ on the Davis-Eells 
Games as on the Pintner Test. It should be noted, too, that 63 per 
cent of the pupils (including 4.1 per cent showing no difference in 
IQ) obtained IQ’s which are within 10 points of each other. On the 
other hand, 10.6 per cent of the obtained IQ’s on the two tests 
differed by more than 20 points. 

8) Differences in IQ as Related to Intelligence Level—A further 
analysis of the data is summarized in Table 3, which presents the 
mean difference in IQ between the Davis-Eells Games and the 
Pintner Test as related to intelligence level on the latter. 

A consistent trend is noted in mean IQ differences between the 
Pintner Test and the Davis-Eells Games as one ascends the Pintner 
IQ ladder. For pupils obtaining Pintner IQ’s between 70 and 79, 
the mean difference between Pintner IQ and Davis-Eells IQ is 7.8 
points, in favor of the latter. In the case of children with Pintner 
IQ’s between 90 and 99, the difference becomes 1.2 points, and is 


1 The courtesy of the Division of Test Research and Service of the World 
Book Company, publishers of the two measures, in providing these data is 
gratefully acknowledged. 
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TABLE 2—NUMBER AND PROPORTION OF PuPILS SHOWING DIFFERENCEs 
IN IQ BETWEEN Davis-EELLS GAMES AND PINTNER TEST 











Higher on Pintner Higher on Davis-Eells 
Difference 

N Per cent N Per cent 
1-10 points 80 32.5 65 26.4 
11-20 points 21 8.5 44 17.9 
21-30 points 9 3.7 10 4.1 
31 or more points 4 1.6 3 1.2 
Total 114 46.3 122 49.6 

















in the opposite direction. At the 100 to 109 Pintner IQ level, the 
Pintner IQ is the higher by 4.5 points. 

In general, the greatest similarity in IQ between the Davis-Eells 
Games and the Pintner Test is noted in the 90 to 99 IQ range. For 
Pintner IQ’s above 100, the Pintner IQ will tend to surpass the 
Davis-Eells IQ; the reverse will tend to hold true for Pintner IQ’s 
below 80. 

4) Differences in IQ as Related to Reading Level—This study was 
undertaken, it will be remembered, as part of a search for an instru- 
ment which would be a more satisfactory measure of the function- 
ing of poor readers. Accordingly, the mean IQ’s on the two tests 
obtained by pupils at varying reading levels were determined. Table 
4 presents a summary of the IQ’s for the 239 pupils for whom grade 
scores on the Metropolitan Intermediate Reading Test, Form T, 
were available. 

It should be noted that by far the largest number of pupils are 
included at reading levels below 5.0. The readability of the Pintner 
Test is much too high for fourth and third grade readers. 


TaBLE 3—MEAN CHANGE FROM PINTNER TO Davis-EELLs IQ’s as RELATED 
TO INTELLIGENCE LEVEL ON THE PINTNER TEST 





Mean Change Pintner to 





Pintner IQ N Davis-Eells 
120 and over 8 —28.4 
110 to 119 15 —11.6 
100 to 109 17 —4.5 

90 to 99 57 —1.2 
80 to 89 77 +2.0 
70 to 79 63 +7.8 








Below 70 9 +10.1 
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TasBLE 4—MeEaAn IQ on Davis-EELLS GAMEs AND PINTNER TEsT As RE- 
LATED TO READING GRADE 








Reading Grade N Davis-Eells Pintner 
8.0 and over 14 102.5 119.1 
7.0 to 7.9 16 102.1 108.2 
6.0 to 6.9 30 94.5 97.4 
5.0 to 5.9 39 91.6 90.0 
4.0 to 4.9 79 86.4 83.9 
Below 4.0 61 81.6 77.1 














At the lower reading grade levels, the Davis-Eells IQ’s, which 
are based upon a test which requires no reading, are only slightly 
higher than those obtained on the Pintner Test. Even at reading 
grades below 4.0, the mean difference is less than five IQ points. 

§) Differences in Distributions—It has been the experience of the 
Bureau that the Pintner Test, when used with the total sixth grade 
population in New York City, places too few pupils between IQ’s 
90 and 109, and too many above and below these limits, particu- 
larly too many below 90. It was felt that the Davis-Eells Games 
would give rise to a distribution which would more closely approxi- 
mate normality. Table 5 presents a percentage frequency distribu- 
tion of the IQ’s obtained by the common group of 246 pupils who 
participated in both tests. 

On the Pintner Test, approximately 30 per cent of the group 
tested obtained IQ’s between 90 and 109, as contrasted with ap- 
proximately 40 per cent on the Davis-Eells Games. The Davis- 
Eells Games thus overcomes the criticism of the Pintner Test that 
the latter places too few children in the average category. 


TaBLE 5—PERCENTAGE DisTRIBUTION OF IQ’s on Davis-EELLS GAMES 
AND PINTNER TEST 








IQ Level Davis-Eells Pintner 
120 and over 1.7 2.9 
110-119 5.5 6.7 
100-109 13.0 5.9 

90- 99 26.9 22.7 
80- 89 31.1 31.9 
Below 80 21.8 29.8 
Total 100.0 99.9 
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At IQ levels below 90, both the Pintner Test and the Davis-Eells 
Games apparently assign too many pupils to this ‘below average” 
category. It seems unlikely that more than half the pupils are below 
average (below 90 IQ) in those aspects of intelligence which the 
Davis-Eells Games purports to measure, or that 60 per cent of the 
group is below 90 IQ, even in the highly verbal type of intelligence 
which the Pintner Test does measure. 

In general, the Davis-Eells Games gives a slightly more favorable 
impression of the capacity of this low-average group of pupils than 
does the Pintner Test. The general distribution of IQ’s on the two 
tests, however, is much the same. 


SUMMARY AND CONCLUSIONS 


As part of a series of studies designed to determine whether a 
more satisfactory measure of the intelligence of poor readers than 
that provided by the Pintner Intermediate Test could be obtained, 
the Davis-Eells Games were administered to a sample of 246 pupils 
drawn from nine sixth-grade classes. A large proportion of these 
pupils were poor readers. The major findings resulting from an 
analysis of the obtained data may be summarized as follows: 

1) Less than a one point difference in mean IQ appears between 
the two measures; this difference is not statistically significant. 

2) Sixty-three per cent of the pupils obtain IQ’s within 10 points 
of each other on the two tests. In general, the greatest similarity 
in IQ between the Davis-Eells Games and the Pintner Test is noted 
in the 90-99 IQ range. 

3) For Pintner IQ’s below 80, the Davis-Eells IQ tends to sur- 
pass the Pintner IQ; the reverse tends to hold true for Pintner IQ’s 
above 100. 

4) The mean Davis-Eells IQ of pupils with reading grades below 
4.0 is only 4.5 points higher than their mean Pintner IQ; the mean 
Davis-Eells IQ of pupils with reading grades between 4.0 and 4.9 
is only 2.5 points higher than their mean Pintner IQ. 

5) The Pintner Test assigned IQ’s below 90 to 62 per cent of the 
group tested; the Davis-Eells Games, to 53 per cent. 

The indications are that there is a high degree of resemblance 
between the results obtained from the use of the two instruments. 
When compared with the Pintner Test, the Davis-Eells Games 
cannot be looked upon as a markedly better measure of the intelli- 
gence of poor readers at the sixth-grade level. 

















ADOLESCENTS’ VIEWS ON ONE ASPECT OF 
THEIR DEVELOPMENT’ 


RUTH STRANG 


Teachers College, Columbia University 


A dominant emphasis in adolescent psychology is on the adoles- 
cent’s concept of himself and perception of his world. Many trends 
focus on this emphasis. Gestalt psychology highlights the im- 
portance of the “situation-as-perceived” by the individual. All 
the writings on the self-concept and ‘“‘self-consistency”’ present an 
individual’s view of himself as a determinant of his behavior. In 
client-centered counseling, the individual is helped to gain an 
understanding of himself and his relationships. In education, too, 
teachers are urged to look below the surface behavior to the mean- 
ing of the behavior of the child. From this emphasis arises the need 
to learn more about the inner world of adolescents and to see 
things through their eyes. 

There are many methods of getting glimpses into how adoles- 
cents are thinking and feeling. Most widely used perhaps are the 
check-list type of opinion survey and the adjustment inventory. 
The questionnaire is a little less suggestive than the check list, 
but still to a considerable extent channels the individual’s re- 
sponses. Something may be learned from informal group discussions 
of adolescent problems. In such discussions and in sociodramas, 
adolescents have expressed some important insights into their 
psychology. Personal documents of various kinds are another im- 
portant source of understanding. In interviews, ranging from the 
casual to the psychoanalytic, some adolescents will reveal a great 
deal of their “private world.” Others, without realizing it, reveal 
themselves in their responses to projective techniques. The slightly 
structured composition combines some of the values of the person- 
ality inventory and the projective method. 

There are individual differences in response to these various 
techniques. Some adolescents are stimulated by the group to think 
freely; others respond best in a face-to-face interview; still others 
welcome the opportunity to express their thoughts and feelings in 





1 Paper read at the 1954 Annual Meeting of the American Psychological 
Association, September 6, New York City. 
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the writing of anonymous compositions. Some adolescents are far 
more suggestible than others and therefore more influenced by the 
items on a check list or the wording of specific questions. 

In this paper, only the responses in freely written compositions 
will be considered. The directions for writing these compositions 
invite introspection on a given subject. If properly introduced they 
evoke frank, thoughtful, and sincere responses. Confidentialness 
is assured. 

Questions will arise, of course, concerning the reliability and 
validity of the data obtained from these relatively unstructured 
compositions. Neither the retest nor the split-half form of re- 
liability seems appropriate to apply to personal documents. A 
repetition of the request for compositions may result in showing 
variation within the individual rather than shortcomings of the 
instrument. “Very few psychologically meaningful properties ever 
really stay put even while the investigator is looking straight at 
them.’ The split-half method obviously is not appropriate to 
data that are in the form of a pattern, instead of a series of discrete, 
comparable items. 

Validity of compositions may be implied in several ways: (1) 
from knowledge of the rapport with the subjects in getting the 
data, (2) from internal evidence of frankness and lack of facetious- 
ness, and (3) from comparison with data of a similar kind obtained 
from group discussions and interviews. The use of the first person 
(“I think”, “I feel’) on the basis of other evidences in the com- 
positions seems associated with the most genuine kind of ex- 
pression. The third person (‘‘adolescents feel...”, “Boys and 
girls often think ...”) seems associated with a measure of reluc- 
tance of the writer to give his own feelings freely. And the second 
person, when used, seems generally to be employed in a short story 
sense (“you should...”) by those who appear to be merely 
repeating lessons that they have learned from parents, teachers 
and other adults. 

Over two thousand compositions of this kind, on a variety of 
topics, were obtained by teachers from adolescents of a wide range 
of ability and socio-economic background in grades seven through 
twelve. Only one of the topics—‘‘How it Feels to Be Growing Up” 





2 Dalbir Bindra and Ivan H. Scheier, ‘‘The Relation Between Psycho- 
metric and Experimental Research in Psychology,’’ The American Psycholo- 
gist, [IX (February, 1954), 70. 
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—obtained from 277 pupils, will be considered in this paper. 
Included in this sample were public high school pupils in a large 
city (70 in number), in a small textile town (22), in a wealthy 
residential suburb (19), in middle-class suburban towns (79), in a 
semi-rural village (14), as well as pupils in a Roman Catholic 
commercial high school (56) and a boys’ military academy (17). 
The teachers presented the topic to the pupils with a minimum of 
structuring of the situation. The pupils wrote the compositions 
during the class period and did not sign their names. 

There are obvious limitations to the data presented. Among 
these are (1) the very small number of compositions from rural 
schools; (2) the lack of supplementary information about the 
individual pupils; and (3) the differences in pupil-teacher relations 
which affect the quality of the responses. The first limitation may 
be quite easily removed in future studies; it is relatively simple 
and inexpensive to obtain these compositions from a large number 
of coéperating teachers. Every teacher who has codperated has 
gained understanding of his pupils quite disproportionate to the 
amount of time spent in reading the papers before they were sent 
to the investigator. The second problem may be solved by giving 
each pupil a number previously identified with the pupil’s name. 
Then the composition may be studied with reference to all the 
other items of information obtained about the individual pupil. 
The third difficulty may be handled by discarding the papers in 
classes where the pupil-teacher relationship was apparently poor 
or the standard directions for writing the composition had not 
been given. The papers usually give clear clues as to whether the 
pupil is writing frankly and sincerely or facetiously. 

Two kinds of information were obtained: (1) common points of 
view and (2) specific psychological insights expressed by individual 
adolescents. The “commonality” aspect is presented in Table I, 
while the individuality in response is represented by the direct 
quotations. 

The common points of view represent the thoughts and feelings 
uppermost in the adolescents’ minds at the time of writing. To be 
sure, there are many other ideas that do not occur to them at the 
moment. Consequently, the meaning of the percentages used in 
this study to describe frequencies of response is not the same as in 
the case of check lists and questionnaires. Both present problems of 
interpretation. The frequency count of the compositions lacks 
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TaBLE I—NuUMBER AND PERCENTAGE OF ADOLESCENTS EXPRESSING 
Pornts oF View Asout GrowinG Up 








Grades 
Viewpoints About Growing Up i185) OO) ta) 





No. % No. % No, % 





Concern about vocation, or | 85 46 31 34 116 42 
about the future 
Feeling of increasing independ- | 68 37 40 44 108 39 
ence and self-direction 
Awareness of increased responsi- | 53 28 42 46 94 34 
bilities 
Concern with social relation- | 37 20 32 35 69 25 


ships 

Concern with boy-girl relation- | 32 17 28 30 60 22 
ships 

Feelings about religion or moral- | 10 5 42 45 52 19 
ity 


Concern with marriage and rais- | 24 13 19 21 43 16 
ing a family 

View of growing up as fun, or 
‘*good’’ experience 

Feeling of frustration that inde- | 23 12 12 13 
pendence is not recognized 

Awareness of increasing accept- | 15 8 14 15 29 10 
ance in the adult world 

Possible reluctance to relinquish 8 4 16 17 24 9 
the security of dependence 

Relation of sports to growing up | 18 10 6 7 24 

Problems of sibling relationships | 19 10 5 5 24 9 


12 21 23 43 16 


& 


13 


& 
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absolute value because many individuals may hold a particular 
viewpoint without expressing it. On the other hand, those respond- 
ing to the check list may mark a particular item with very little 
basis for doing so. In both instances, the nature of the sample will 
affect the frequency of response on separate items. 


COMMON RESPONSES 


Common Responses to the Topic: “How It Feels To Be Growing 
Up’”’.—The tone of these compositions was generally positive and 
expectant. There was little evidence of the severe “strain and 
stress” emphasized by early writers on adolescence. Almost one 
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sixth referred to this period as “fun’’ or “a nice time of life’’-—they 
had a “good” feeling about it. 

Moreover, the data do not bear out the general impression of 
adolescent irresponsibility. They accent adolescents’ desire to be 
independent and to take responsibility. Many of these young people 
thought of growing up as involving increasing independence, self- 
direction, and responsibility. Only a few continued to show some 
reluctance to relinquish the security of dependence. They are more 
likely to have feelings of frustration that their independence is not 
recognized. Some of them were aware of increasing acceptance in 
the adult world. The younger pupils seemed to be more optimistic 
and less realistic than the older group. 

A surprisingly large number of junior high school pupils seemed 
to be concerned about vocations and the future. This high fre- 
quency of reference to vocations may be attributed to their inter- 
pretation of the topic as focused on the future instead of on the 
process of growing up. Another possible explanation is that the 
responses of some of these pupils may have been influenced by a 
unit on vocations recently taught in the junior high school. 

The accompanying table presents, in order of frequency of 
mention, the categories into which the responses were classified, 
together with the number and per cent of responses in each. The 
tabulation was done by Dr. Warren Roome, who read each com- 
position and classified each significant statement under the one 
heading which best described it. The headings evolved from a pre- 
liminary reading of the compositions. 

The sequence of categories from junior to senior high school 
suggests a kind of meaningful progression in the process of growing 
up. As children grow older their feelings of independence and self- 
direction increase. They become much more concerned with re- 
ligious and moral questions and with social relations, especially 
with boy-girl relations and marriage and family life. Their aware- 
ness of increased responsibilities grows. As they grow up physically 
and assume more responsibility they also become increasingly 
aware of their acceptance in the adult world. 

Some variation was noted among the different kinds of schools 
in which the compositions were obtained. For example, the papers 
from the Catholic schools showed a high degree of uniformity. A 
possibly significant concept which seemed to pervade these com- 
positions is that of sudden independence, of almost instantaneous 
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transformation from not-adult to adult, at some specified age— 
either sixteen or eighteen. Possibly also of significance is the 
relatively high frequency with which these pupils expressed either 
a reluctance to leave childhood or fear of adulthood. Their re- 
ligious orientation is also marked. Nevertheless, in these and even 
more in the other compositions unique statements of a genuinely 
personal viewpoint were expressed. 


SPECIFIC RESPONSES 


More valuable in understanding adolescents than the frequency 
of common responses are the psychological insights occasionally 
expressed by individual boys and girls. Those insights have the 
following claims to authenticity: 

1) They are based on the first-hand experience of adolescents 
themselves, not on observation of their behavior by adults. 

2) They depend less on delayed memory than many statements 
about adolescents made by adults. 

3) They express attitudes and feelings directly, whereas most of 
the data about adolescents consists of inferences regarding their 
attitudes and feelings. 

4) They may be culled from a large number of compositions 
written by teen-agers and representing a wide range of age, intel- 
ligence, socio-economic status, family backgrounds, emotional ad- 
justment, geographical location. 

A few of these insights are included here as illustrations. They 
are grouped under five main categories, corresponding to the more 
detailed categories in the table: (1) reluctance to grow up or a feel- 
ing that adults do not understand, or frustration in growing up; 
(2) concern about physical, emotional, social, and other aspects of 
growing up; (3) a somewhat neutral awareness of problems and 
opportunities; (4) feelings of increasing independence, self-direc- 
tion, and responsibility; and (5) satisfaction in growing up and 
awareness of their acceptance in the adult world. The quotations 
have been placed under what seemed to the writer the most appro- 
priate heading, although other points of view may also be repre- 
sented in a single quotation. 

1) Reluctance to grow up, a feeling that adults do not understand or 
frustration in growing up.—As shown in the table a relatively small 
percentage mentioned the feelings illustrated by the following quo- 
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tations. So far as this sampling shows, these attitudes seem to be 
related to the individual personality more than to the age of the 
pupils. 

“Nothing angers me more than to have someone say, ‘Oh, you 
are still young with no problems yet’.” 

“T am expected to have manners like an adult, yet have to go 
to bed early like a little child.” 

“Parents say, “You’re old enough to help out with this,’ and 
‘You can’t do that; you’re much too young’.” 

“More often you wonder what it will be like when there is no 
mother and father to guide you and it scares you.” 

“Growing up is a hard thing to face because childhood is so dear 
to our hearts.” 

“The feeling that comes with growing up sometimes frightens 
you. The thought that now you are responsible for all your actions. 
People can’t say, ‘She is only a child and you can’t blame her for 
her bad actions’.”’ 

“My friends and I have, at one time or another, felt that our 
parents did not understand us, nor care to.”’ 

“Grown-ups say they remember how it was to be growing up: 
that they understand our problems. I disagree, as they forget the 
unpleasant memories; only someone who is going through the 
process knows what it is like to be growing up.” 

“When I finish growing up, which I doubt anyone really does 
completely, I hope I will be able to remember how I felt so that I 
can understand people who are growing up then.” 

2) Concern about physical, emotional, social, and other aspects of 
growing up.—In these quotations, which are typical of a large per- 
centage of the compositions studied, points of views presented by 
psychologists concerning uneven growth and acne, boy-girl rela- 
tions, swings of mood, and alternation between childish and adult 
interests are reinforced. 

“The age of thirteen is a difficult age .... When boys are thir- 
teen they are too old for cowboys and Indians and too young for 
girls. Girls are too old for dolls and too young for boys.” 

“Such disadvantages as legs too long, arms too long, pimples, 
and many other things that make a boy feel very out of place.” 

“Growing up has made me feel different about religion. When I 
try to understand and ask questions, I either get a confusing answer 
or my question cannot be answered.”’ 
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“Growing up in the world today means an uncertain future. It 
means bringing our own children into a troubled world. Waiting 
years for boys who may never return from war results in emotional 
upset for many.” 

“One of the difficulties of teen-agers is their moods. Sometimes 
they get up in the morning feeling wonderful. Instead of talking, 
they feel like singing. Other days they feel wretched and depressed. 
They can hardly drag themselves around .... These moods come 
and go in a mysterious way, for just no reason at all.” 

3) A somewhat neutral awareness of problems and opportunities.— 
Instead of concern about their problems, some of the youngsters 
take an understanding, accepting, and even humorous attitude 
toward them. This insight into the growing-up process is illustrated 
by the following quotations: 

“Parents, it seems to me, worry too much about our welfare. 
But that is simply because they are under a false impression; we 
are grown up long before they have a chance to realize it.” 

“Most girls think about only one thing—boys.” 

“Every time a boy pays a nice compliment, I feel that I’ve grown 
up more. It makes me think that in some way I’ve learned to 
better my appearance.” 

“Some girls when they are quite young are regular tomboys. 
Then all of a sudden, something happens. They become conscious 
of their appearance. They start to take care of their hair. . . they 
start to hang up their clothes ...they begin to notice different 
styles. They wear their first pair of high heels and start to apply 
lipstick.” 

‘My brother is three years older than I am.... Wherever I go 
I no longer am Margaret Kerr but Sam’s sister. My brother and 
I get along swell. Even though he calls me ‘Stinkweed’ I know it’s 
just his peculiar way of showing me affection.” 

4) Feeling of increasing independence, self-direction, and responsi- 
bility. Many illustrations of this point of view might be quoted. 
In fact, in this sampling of adolescents it was the dominant way of 
perceiving the process of growing up. They expressed simultane- 
ously their desire to be free from parental domination and restric- 
tion of childhood and their eagerness for more mature experiences. 

‘‘When a boy reaches his teens he doesn’t like to be led around by 
his mother’s apronstrings and he wants freedom and likes to feel 


self-dependent.”’ 
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“Instead of running from mistakes as a child would, I now see 
the wisdom of facing and conquering error.”’ 

“T like the idea of being able to choose and make decisions for 
myself.” 

“As I grow older I can do more things without my parents 
having to worry about me.” 

“Growing up means... facing problems with almost no help.” 

‘(My problems are now solved by me. I no longer run to Mom 
to solve them unless her advice is absolutely needed.”’ 

“As you grow up you are not so much of a burden to your 
parents, and they do not worry about you but pass on the worries 
to you and you carry your worries for yourself.” 

“When a problem arises, big or small, you can work it out with- 
out getting yourself and everyone around you confused and ner- 
vous.” 

“T am beginning to understand the responsibilities of my posi- 
tion, and most amazing of all, to tackle them.” 

5) Satisfaction in growing up and awareness of their acceptance in 
the adult world. Although their desire for independence is much 
more frequently mentioned than their satisfaction in the process 
of growing up, almost one sixth referred to the rewarding aspects 
of adolescence and one tenth recognized their increased acceptance 
by adults. The following quotations refer especially to the ego 
satisfaction of being able to handle the problems of adolescence 
and being accepted on a mature level. In much of the writing on 
adolescence these positive feelings about growing up are over- 
shadowed by the emphasis on anxiety, insecurity, and “‘problems.”’ 

“Growing up makes me feel as if I were somebody important.” 

“T derive a great satisfaction from the solution of my problems 
and even anticipate the advent of new ones with a curious mixture 
of anxiety and exhilaration.” 

“As you reach your teens you find that people will talk to you 
and let you give your opinions.”’ 

“T am now on eye level with adults and beginning to be accepted 
as &@ person in my own right.” 

The wide range of attitudes toward growing up is represented by 
these two responses: (1) “I think that growing up is one of the 
most exciting things that can ever happen to anyone,” and (2) “On 
the whole growing up is the biggest ordeal anyone goes through. 
I believe I have had more trouble now than I ever had. Disap- 
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pointments and let-downs are the biggest hurdle in growing up. 
One thing I learned to do is never build up my hopes.” 

The statements made by adolescents about themselves are 
appealing; they are directly, simply, and often vividly stated. 
Many of them verify the viewpoints of psychologists about adoles- 
cents. Some of them suggest positive feelings and attitudes that 
perhaps have not been sufficiently recognized in adolescent litera- 
ture—(1) the satisfaction that comes from successful accomplish- 
ment of adolescent tasks, (2) the recognition that responsibility 
should accompany freedom, (3) their eagerness to assume re- 
sponsibility, and (4) their insight into adults’ points of view. Most 
of all, these compositions alert us to individual differences among 
adolescents and the diversity of viewpoints and feelings which they 
express. 











EFFECTS OF VARIATIONS IN TEACHER ROLE 
BEHAVIOR ON THE STUDENT-TEACHER 
RELATIONSHIP 


MARTIN M. GROSSACK 
Philander Smith College, Little Rock 


The concept of social réle has become one of the most widely 
used explanatory tools in social science. Parsons et al. have theo- 
retically described several related dimensions of social roles.' In 
western society, réle expectations differ significantly in the areas 
of formal organizational behavior in comparison with the demands 
of kinship and friendship situations. Large organizations, such as 
educational systems, theoretically require behavior that is uni- 
versalistic, non-emotional and specific in function. On the other 
hand, situations of kinship and friendship carry with them rdéle 
expectations of particularistic orientation, emotional relationships 
and diffuseness of function. 

An example will illustrate the differences inherent in the two 
types of situations. The teacher is expected to treat all students 
alike regardless of whom they are (universalism), show no emo- 
tional involvement in their conditions or towards them as in- 
dividuals (there is, for example, a general taboo on student-faculty 
dating), and his authority is specifically limited to problems of 
teaching (he has no power over student’s religious or private be- 
haviors as long as they do not affect the behavior of the student 
as a student). 

The experiment reported here is concerned with the applica- 
bility of this conceptualization to the student-teacher relationship. 
Three predictions were derived for empirical test: 

1) Emotional ties will develop more frequently in conditions 
of particularism than in those of universalism. 

2) Respect of status differences will be less frequent in condi- 
tions of particularism than in those of universalism. 

3) Barriers to non-relevant communications will be greater in 
conditions of universalism than in those of particularism. 





1T. Parsons and E. A. Shils, Eds. Towards A General Theory of Action. 
Cambridge, Massachusetts: Harvard University Press, 1951. 
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PROCEDURE 


Eighty-one Negro Ss were made available to E during a meeting 
of students. Ss were randomly given one of the following mimeo- 
graphed characterizations: 

A. Particularistic Form.—Professor Jones was brought up in a 
small Midwestern town. Then he attended a small college. As a 
teacher he tends to form close friendships with his students and gets 
emotionally involved in his teaching and his relations with students. 

It is known that Professor Jones takes part in many social ac- 
tivities with students and calls them by their first names. He shows 
some preferences in his relationship with students and possibly 
treats a few better than he does the rest. 

B. Universalistic Form.—Professor Jones was brought up in a 
big city and attended a large Midwestern University. As a teacher, 
he is very careful to keep his distance from any close friendships 
with students. He just teaches his courses without emotional in- 
volvement in relations with his students. Outside of class, he has 
as little as possible to do with his students. He makes it a practice 
to treat all students alike and calls them Mr. or Miss, rather than 
by their first names. 

This procedure, in effect, presented two ideal-types and assumed 
there would be differences in response to them. Each § received 
the same fourteen-item questionnaire. 


RESULTS 


The differences in questionnaire responses by recipients of the 
particularistic and universalistic teacher characterizations are 
shown in the table. ‘ 

1) Emotional ties hypothesis.—The prediction that emotional ties 
will develop more frequently in conditions of particularism than 
in those of universalism received some support from the results 
in Table 1. Ss were more likely to feel friendly towards the par- 
ticularistic characterization (P.10) and slightly, but not signifi- 
cantly, more likely to answer items 2, 4, 8, 11, and 14 favoring it. 

2) Respect of status hypothesis.—The prediction that respect of 
status differences will be less frequent in conditions of particularism 
than in those of universalism was supported. Ss were more likely 
to respect the universalistic characterization and feel he grades 
fairly (P.10) and slightly less likely to consider him a “‘small guy”. 
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TaBLE I—DIFFERENCES IN QUESTIONNAIRE RESPONSES CONCERNING 
TEACHERS WITH PERCEIVED PARTICULARISTIC AND 
UNIVERSALISTIC ROLE BEHAVIOR 





Characterization 








Question Particularistic | Universalistic x? P 
tney| Nt, | une | 38 





How likely would you be to: 
1. Tell Professor Jones 29 17 13 20 3.30 .10 
your personal problems? 
2. Attempt to develop aj| 37 8 23 10 ° ° 
friendship with him? 
3. Joke with him? 
4. Take a course with him? 
5. Think he is a small guy? 
6. Think he deals smoothly 
with people? 
7. Tell him your disagree- | 39 8 21 12 2.90 10 
ments with him? 


12 9 24 13.50 .001 
12 19 13 ° ° 
20 12 19 ° ° 
15 22 ll ° ? 


SRES 


8. Think he has a healthy | 33 14 19 13 . ° 
personality? 

9. Feel friendly towards | 41 4 20 8 3.13 10 
him? 

10. Think he grades stu- | 27 18 27 6 3.31 .10 
dents fairly? 

11. Think he likes the stu- | 39 7 26 7 . S 
dents? 

12. Learn a lot in his class? | 29 16 22 10 ° ° 

13. Respect him? 39 7 32 0 3.63 .10 

14. Feel he knows his stuff? | 33 12 25 11 a ° 























* Not significant. 


3) Barriers to communication hypothesis.—The results suggest 
that universalistic réle behavior will serve as a barrier to non- 
relevant communications. Ss would be less likely to joke with the 
universalistic professor (P.001) or to tell him their personal prob- 
lems (P.10). However, the formality of the réle relationship might 
also serve to minimize the amount of disagreements that are ex- 
pressed overtly to the universalistic authority figure (P.10), sug- 
gesting that the particularistic individual might less likely have 
people griping without his knowledge. 
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DISCUSSION OF FINDINGS 


The implications of these results for educational psychology 
merit consideration. Although the statistical evidence is suggestive 
rather than conclusive, the social psychological factors in the 
student-teacher relationship have received scant empirical atten- 
tion from educators; therefore, any evidence merits attention. 

First, it is important to raise the question of the inherent incom- 
patibility of the two types of réle orientations. Do students having 
close friendships with teachers believe that they receive the same 
impartiality in grading as non-friends? Theoretically, such impar- 
tiality is not possible for friendship relations. Can the teacher who 
judges students’ performance really give emotional support to all 
class members? It would seem that the réle of teacher as teacher 
should be limited to non-therapeutic activities. 

The results show that students will more readily develop positive 
affect towards particularistic teachers and more readily confide 
in them. Impersonal teachers, that is, those who deliberately take 
a universalistic orientation, have to realize the necessity for creating 
a congenial working climate to offset the formality of their réle 
behavior. Possibly, it is best to encourage the students to feel 
friendly and confide, but good sense dictates that the teacher 
should not reciprocate to the extent of an emotional involvement 
that could disrupt his réle behavior as teacher. 

Réle-playing should probably receive more attention in teacher 
training programs so that some of the conflicts inherent in the 
réles of teacher can be anticipated by future teachers. A body of 
reliable knowledge is necessary and urgent if our knowledge of how 
to teach and to relate to students can be based omevidence rather 
than tradition. 


SUMMARY 


From Parsons’ theoretical description of particularistic and uni- 
versalistic dimensions of réle behavior, two ideal characterizations 
of college professors, one exemplifying the former orientation, the 
other the latter, were presented to student Ss. Differences were 
found in the perceptions and in the expected communications 
students would make to professors of each type. Evidence was 
presented supporting hypotheses of fewer emotional ties, more 
respect of status differences, and more barriers to communication 
towards the universalistic characterization. 











TEACHER REACTIONS TO CONFLICT 
SITUATIONS 


SAMUEL I. SPECTOR 


Center for Human Relations Studies, New York University 


In the practice of human relations, since every individual is 
motivated by his inherent and acquired frame of reference, per- 
sonal and therefore varied responses are made to given stimuli. 
In the general scope of interrelationships in which each individual 
does not have continued daily concentrated contact with one or 
more fellow human beings, the impress of his frame of reference 
may be transitory. In other words, a mood or attitude may be 
evoked during the contact, but it is likely to disappear when the 
stimulus is removed. However, when there are daily meetings as in 
teaching, continual exposure to human relations conflicts, especially 
as related to supervisor, teacher, and pupil interaction, presum- 
ably leaves a more permanent impression. Here, success in human 
relations cannot be made dependent on the hope that somehow 
they will muddle through. Since the full development of children 
is involved, there is too much at stake. And, further, since in this 
development, the teacher is the focal point, his ability to interact 
successfully becomes a matter of prime importance. With this 
thought in mind, the study described below was initiated. 

At the outset these questions arose: What conflict situations 
would be common to the majority of teachers? What would be the 
measure of behavior responses? Would teachers react uniformly or 
diversely to a given conflict situation? Would they show a lag be- 
tween their concept of what was the right thing to do, what was the 
practical thing to do at the moment of conflict, and what they 
actually did? What evidences of authoritarian behavior would be 
apparent? What are the implications of the uniformity or diversity 
of response for teacher training? 

It was felt that a questionnaire might be the suitable instru- 
ment, since there was no intention to probe deeply into individual 
characteristics. The purpose of the study was to determine how a 
number of teachers would react to the same stimuli. On the basis of 
talks with supervisors, teachers and pupils, conflict situations were 
determined. There were five for each relationship (teacher-super- 
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visor, teacher-teacher, and teacher-pupil), classified as to origin 
(self initiated, initiated by other individuals, and initiated by cir- 
cumstances). The subjects were asked to state what they would do, 
should do, or did in terms of suggested courses of action listed by 
letter and number, in the situations outlined. Thus in the teacher- 
supervisor relationships, the following were posited: 

1) Your principal is interested in a certain philanthropy. He 
wants all teachers to subscribe. You do not care to participate. 
He presses you daily. 

2) You have invited your supervisor to observe you. He visits 
for an hour and leaves behind a very uncomplimentary report. He 
refuses to discuss the write-up with you. 

3) The supervisor is riding a weak teacher. You cannot tolerate 
this treatment and intervene with the supervisor. He tells you to 
mind your business and plainly shows his resentment at your inter- 
ference. 

4) The supervisor suggests a certain teaching procedure. You 
know that it will not work in your class. He claims superior knowl- 
edge and insists you use his suggestion nevertheless. Tempers flare. 

5) As a disciplinary measure, an older child from a higher grade 
has been placed into your class. He gets into a quarrel with one of 
your pupils and injures him severely. The principal holds you com- 
pletely responsible for the difficulty. 

Numbers 1 and 2 represent situations initiated by other individu- 
als, numbers 3 and 4, those that were self initiated, and number 5 
a situation arising from circumstances. 

In the teacher-teacher category, the following situations were 
suggested : . 

1) You have been accused of spreading gossip about another 
teacher. You are entirely innocent. The teacher confronts you and 
berates you. 

2) The teacher in the adjoining room is out of her room often 
during the day. At these times, the noise from her class disturbs 
your work. You remonstrate with the teacher. She tells you to stop 
being a goody-goody and kill-joy. 

3) A coarse and shallow fellow teacher, whom you do not like, 
seeks your friendship. He goes out of his way to meet you with his 
attentions and small gifts. Despite all your efforts to discourage 
him he keeps on trying. 
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4) There is a coveted assignment that you want, but that should 
go to a fellow teacher because of his seniority. You pull strings and 
get the assignment. Whenever he sees you he shows his resentment 
by his looks and remarks. 

5) You have been away from school for two weeks because of ill- 
ness. The school Social Committee, which usually recognizes such 
absences by a gift, has failed to do so in your case. Upon your re- 
turn, the committee lamely and weakly mentions the oversight. 

Numbers 1 and 3 represented initiation by other individuals, 
2 and 4 initiation by the teacher, and number 5 initiation by cir- 
sumstances. 

The third group of situations concerned teacher-pupil interac- 
tions. They were as follows: 

1) You are irritable. A child drops a book accidentally. You 
berate her. Stung by your injustice, the child glowers and mutters 
under her breath. 

2) While the supervisor is present, the smart-alec in your class 
decides to show off. In answer to your every gesture and silent 
plea, he shows off still more. He mimics you. 

3) One of your pupils comes to school in ragged clothing. You 
obtain some good second-hand clothing, and, in privacy, offer him 
better clothing. The child bursts into bitter resentful tears and runs 
out of the room. 

4) One of your pupils seems to show an instinctive dislike for 
you. He goes out of his way to irritate and annoy you. His special 
pleasure seems to be to mispronounce your name purposely. 

5) You are on duty in the yard. Some large boys and girls from 
the graduating class are fooling around. They do not see you ap- 
proach. When you scold them they get fresh, not realizing that you 
are a teacher. The boys, to save face, try to act tough with you. 

Here, too, the genesis is self initiation for numbers 1 and 3, other 
initiation for numbers 2 and 4, and circumstance initiation for 
number 5. It should be noted that each of the fifteen situations 
was a part of the writer’s personal or vicarious experience. 

The formulation of a continuum of courses of action was the 
result of many castings and recastings in the effort to graduate 
behavior patterns ranging from the immature physical exchange 
of blows to the objective attempt to determine the cause of irrita- 
tion. Some degree of continuum was obtained, but exclusiveness 
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of the various gradations was difficult to achieve. The courses of 
actions are as follows: 


a.l. Exchange blows 2. Threaten with fists 3. Curse vocally 
b.1. Talk back 2. Bluster 3. Threaten 
4. Curse under the 
breath 
c.l. Laugh it off 2. Placate by smiling 3. Be coldly »olite 
4. Use soft words 
5. Divert attention to 
something else 
d.1. Remain calm and 2. Ignore 3. Walk away 
silent 
e.1. Feel annoyed 2. Feel resentful 3. Feel hurt 
f.1. Discuss calmly 2. Stick to the facts of 3. Try to determine the 
the situation cause of the irritation 


The fact, that except for a.1 and a.2, all courses of action were 
selected by one or more subjects, seems to give some validity to the 
continuum. 

The format of the questionnaire deserves some description. It 
consisted of two pages, each headed by the list of courses of action. 
At the very outset, the purpose of the questionnaire was stated in 
these words: 


‘‘The purpose of this questionnaire is to determine how teachers feel 
they should and would act and how they did act in real or possible situa- 
tions. Alongside each situation is placed a form containing three blank lines 
for the three possibilities—you should do; you would do; you did. From the 
list of possible courses of action listed below, you are asked to select the 
letter and number of the action you would choose to suit the situation; and 
to place them in the respective blank lines representing the possibilities.”’ 


On the left side were set three columns of possibilities, and on the 
right the five situations. The example below is indicative. 


Possibilities 
You You 
should would You 
do do did Situation 





1. Your principal is interested in a certain philan- 
thropy. He wants all teachers to subscribe. You do 
not care to participate. He presses you daily. 

2. You have invited your supervisor to observe you. 
He visits for an hour and leaves behind a very un- 
complimentary report. He refuses to discuss the 
write-up with you. 





The same format was repeated on the second sheet for teacher- 
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teacher and teacher-pupil situations. At the bottom of this sheet, 
information was requested about the age of the subject (above or 
below 40), the number of years’ teaching experience and the 
number of schools in which the subject had taught. Age was 
asked to determine whether a relationship existed between this 
factor and the course of actions selected. The same rationale 
motivated the question on teaching experience. Finally, there was 
the possibility that teaching in a great number of schools indicated 
an instability, which would be represented in choices of more im- 
mature courses of action. Signatures of the teachers were not 
solicited so that free and uninhibited choices might be made. 

The questionnaire was administered to fifty-three teachers from 
five different schools, servicing the same type of lower middle- 
class population. There was no attempt to differentiate between 
the sexes. Of these teachers, eleven indicated that they were below 
forty years of age, and twenty three that they were above that 
age. The average number of years of teaching experience was 
eigateen. The average number of schools taught in was three. 
Careful examination of the choices made did not indicate a positive 
relationship between age, experience, or number of schools and 
the selection of courses of action. 

In terms of situations, it appears that the greatest number of 
choices for any of them did not exceed fifty per cent. This means 
that, for the most part, fewer than one half of the teachers made 
the same choice of a course of action—a great diversity indeed. 
This variety of response is indicated by the following breakdown, 
in terms of the most selected behavior reaction f.1 or “discuss 
calmly”’. 


TEACHER-SUPERVISOR RELATIONSHIPS 
Should Do Would Do Did 


Other-initiated situations 28% 19% 20% 
Self-initiated situations 40% 20% 20% 
Circumstance-initiated situations 50% 40% 35% 


(Reap: 28% of all teachers in other-initiated situations chose the f.1 
category, etc.) 


At this point, the decrease between the “should do” and “would 
do” possibilities should be noted. This seems to indicate a lag 
between values (should do) and the application of these values 
(would do). Of interest, too, is the greater claimed (by the sub- 

















442 The Journal of Educational Psychology 


jects) ability to apply calm discussion to circumstances—initiated 
comlict situations. At least half of the teachers could not agree 
upon one course of action. 


TEACHER-TEACHER RELATIONSHIPS 


Should Do Would Do Did 
Other-initiated situations 22% 24% 30% 
Self-initiated situations 31% 20% 33% 
Circumstance -initiated situations 12% 1% 1% 


(Reap: 22% of all teachers in other-initiated situations chose the f.1 
category, etc.) 


Here again a lag between values and their application is evident, 


though to a much lesser degree. It is especially apparent in self- 
initiated situations. At least two-thirds of the teachers could not 


agree upon one course of action. 


TEACHER-PuPpiIL RELATIONSHIPS 
Should Do Would Do Did 


Other-initiated situations 138% 12% 20% 
Self-initiated situations 25% 20% 25% 
Circumstance-initiated situations 27% 21% 31% 


(Reap: 13% of all teachers in other-initiated situations chose the f.1 
category, etc.) 


The lag between should-do and would-do choices continues in this 
category also. Again at least 69% of the teachers could not agree 
upon one course of action. Along with Teacher-Supervisor Rela- 
tionships it shows the greatest choice agreement in circumstance- 
initiated situations. 

From another point of view, these statistics are of interest in 
illustrating diversity of responses. Of fifteen possible responses in 
the Teacher-Supervisor category, 9 or 60% were indicated for 
behavior pattern f.1 (discuss calmly), 3 or 20% for f.2 (stick to 
facts), while the rest were scattered. In the Teacher-Teacher rela- 
tions, of the 15 possible responses 7 or 46% were indicated for f.1 
(discuss calmly), 3 or 20% for c.2 (placate by smiling), while the 
rest were scattered. In the Teacher-Pupil interactions, of 15 pos- 
sible responses 5 or 34% were indicated for f.1 (discuss calmly), 5 or 
34% were indicated for d.2 (ignore) and the rest were scattered. 
It is interesting to note the progression in the f.1 choices—34% 
for Teacher-Pupil relationships, 46% for Teacher-Teacher rela- 
tionships and 60% for Teacher-Supervisor relationships. It is 
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heartening to note that so many teachers feel that they can discuss 
conflict situations calmly with their supervisors. 

Another breakdown in terms of greatest choices indicates the 
following choices of courses of action: 


CoursEes or ACTION 


Teacher-Supervisor b.1 dl d2ele2e3 f1 {2 {3 
Teacher-Teacher c.l c3 dl d2 e.l 8 & @ 
Teacher-Pupil c2 c4 ef dl d2 el f.l {3 


Here the majority indicated the following reactions: d.1 (remain 
calm and silent); d.2 (ignore); e.1 (feel annoyed); f.1 (discuss 
calmly) and f.3 (determine cause of irritation). In Teacher-Pupil 
relations only 2 of 15 possible f.3’s were indicated. If this reaction 
is most desired, the teachers are sadly wanting. The emphasis on 
f.1 (discuss calmly) does not presume constructive discussion. The 
choices of c.4 (using soft words) and c.5 (diverting attention) in 
Teacher-Pupil relationships is of interest, but they represent less 
than 27 % of the total responses. This is also true for the c.3 (being 
coldly polite) choice in Teacher-Teacher relations. Here for Situa- 
tion 3, as much as 43 % of the choices were involved. 

In terms of what the teachers thought they should do and what 
they would do, there are interesting comparisons. In Teacher- 
Supervisor relations, 42 % of the teachers said that they should dis- 
cuss Situation 1 calmly, but only one half of this number felt they 
would do so. Approximately the same response was made for 
Situation 4. Fifty per cent felt that they should and would discuss 
calmly Situation 5. Further in Teacher-Teacher relations, 42% 
declared that they should discuss calmly Situation 1, while 47% 
felt that they would do so. For Situation 2, the comparable per- 
centages were 42 and 30. For Situation 3, about 40% said they 
should be coldly polite, while 50% said they would be so. Finally 
in Teacher-Pupil relationships, 25% of those answering said they 
should and would ignore in Situation 1. This was also true for 
Situation 5. In Situation 3, 34% said they should discuss calmly, 
while 29 % said that they would. In general, it would seem that in 
all categories, Situations 1, 3 and 5 received the greatest number of 
choices. All of the foregoing attests to the diversity of choice of 
behavior patterns. 

On the premise that individuals who made “a” and “b’’ action 
choices (exchange blows to curse under breath) were immature 
and authoritarian personalities, fifteen questionnaires containing 
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such responses were analyzed in detail. It was determined that of 
these 15, 6 were over forty years old and 3 under this age. This 
compares with 23 and 11 for the total population. Six did not give 
their age, a possible indication of suspiciousness. The average 
teaching experience of 10 of the 15 (5 did not answer) was 18 years, 
the same as for the total population. The proportion for number of 
schools taught in also remained constant. It is thus evident that 
in the study, age experience and number of schools taught in were 
not affective factors in the authoritarianism of the personalities. 
In the range of choices for all categories (Teacher-Supervisor, 
Teacher-Teacher, Teacher-Pupil), a.1 (exchange blows) and a.2 
(threaten with fists) choices were absent. All other behavior reac- 
tions were included. The a.3 action (curse verbally) was indicated 
as a “would” choice by one teacher for Situation 4 in Teacher- 
Pupil relationships and by another teacher for a “‘would” choice for 
Situation 2 and a “should” choice for Situation 4 in Teacher- 
Supervisor relationships. 

Authoritarianism showed itself most in Teacher-Pupil interac- 
tions when 13 “did” responses, chiefly in the “threaten” behavior 
pattern, were indicated. It appeared less in Teacher-Supervisor 
relationships, again with no duplications, and not at all in Teacher- 
Teacher relations. In the Teacher-Supervisor category, the chief 
reaction chosen was ‘“‘talk back’. The same was true for the 
Teacher-Teacher category. In the Teacher-Pupil category, threats 
and talking back were made chiefly for Situations 2 and 5; in the 
Teacher-Teacher category talking back was indicated for Situa- 
tion 1; while in Teacher-Supervisor relations, talking back was 
chosen for Situations 4 and 5. In all three categories, the teachers 
were able to tell most readily what they would do as indicated by 
a 91% response, less readily what they should do as indicated by a 
75% response, and least readily what they did as indicated by a 
63 % response. The teachers showed the greatest frustration in #5 
Situations, those that were caused by circumstances beyond their 
control. In the Teacher-Pupil category, the great choice of Situa- 
tion 2 which involves the presence of a supervisor is interesting for 
its interactive possibilities, ie., fear of or discomfort with the 
presence of the supervisor made the security of the teacher less 
stable and helped project this feeling into the reaction toward the 
pupil. The large choice of Situation 4 in the Teacher-Supervisor 
category, which involves a self-initiated dispute about the super- 
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visor’s asserted superior knowledge of teaching procedure, indi- 
cated authoritarian personality traits in that the teachers felt that 
their knowledge and status were under attack. Many of the teach- 
ers showed immature behavior when accused wrongly by a fellow 
teacher. Fifteen out of 53 cases, a percentage of 30, showed evidence 
of immature behavior. Finally, it appeared that the greatest num- 
ber of choices over the widest gamut of choices indicative of au- 
thoritarian behavior patterns, was to be found most in Teacher- 
Pupil relations, less in Teacher-Supervisor interactions and least 
in Teacher-Teacher relationships. 

What, now, are the implications of the study, especially for 
teacher training? In a pilot study of this kind, based on so small 
a sampling, no definitiveness is possible. However, certain trends 
are apparent. If the teachers studied are representative, and more 
than half of them did not react in the same manner to a variety of 
conflict situations, what would be the effect on the emotional 
growth of our children? Further, to what extent will this disparity 
in behavior reaction between teacher and supervisor and fellow 
teacher in turn affect teacher-pupil relationships? Is the lag be- 
tween what the teachers felt they should do and what they 
indicated they would do, an indication of the failure of the teacher- 
training program? Is it the business of the college to develop in 
embryo teachers an emotional objectivity that would make it 
natural for them to try to determine the cause of irritation in cases 
of conflict? These questions merely scratch the surface of possible 
implications. However, what does arise from the study is the need 
for teaching concrete human relations courses in the various teacher 
training institutions. These would involve the resolution of major 
individual emotional problems or weaknesses through mental 
hygiene courses, group therapy or even individual psychiatric 
attention. They would include the establishment of clear, cogent 
values in education and democracy, made so vital, that the lag 
between their conception and application would be reduced to a 
minimum. Finally, they would provide for the analyses and treat- 
ment of concrete human-relations conflict situations in all phases 
of the teacher training, from the philosophy of education course to 
student teaching. This is the meat upon which good teacher tyros 
must feed! 











Book Reviews 


Doris M. Opium. Psychology, The Nurse and the Patient. New 
York: Philosophical Library, 1952, pp. 168. 


How psychology has helped and can help nursing in England is 
what the book is about. It is a revised edition of a syllabus used 
for preliminary and final examinations for State registered nurses 
in London. The author has played the réle of physician, psychia- 
trist, lecturer to nurses and also of patient, and thinks she has 
gained enough insight from all of this to transmit to others. The 
material is presented in nineteen short chapters. The more prac- 
tical ones which will more likely make sense to the American reader 
are the chapters on the Patient, the Hospital and the Patient, the 
Nursing of Children, Treatment of Psycho-Neuroses. The theoret- 
ical considerations of the kind of psychology that can be helpful to 
nursing will be puzzling to some American psychologists and ques- 
tionable to others. The chapter on the How and Why of Human 
Behaviour and Inborn Mental Factors more or less accepts the 
McDougall list of instincts and supplements it with other more 
generalized lists. Here are given as ways of helping nurses a list of 
instincts and emotions: flight with fear, repulsion with disgust, sex 
with tenderness, etc. Many facts are presented here as if the last 
answer was in, although there are still plenty of question marks on 
them. Maybe it’s advisable for a nurse to just do what is being done 
or told to do rather than ask many questions abdut the limitations 
of some of the treatments listed here. For general purposes, there 
is no doubt that this volume has served and can serve English 
nurses in training. For more reliable knowledge of what psychology 
has to teach about human behaviour that could be of use to nurses, 
another book can be written. H. MELTzER 

Psychological Service Center, St. Louis, Mo. 


Tae Kentucky Symposium. Learning Theory, Personality Theory, 
and Clinical Research. New York: John Wiley & Sons, 1954, 


pp. 164. 


At the initiative of the Psychology Department of the University 
of Kentucky and with the support of the University administration, 
a very worthy effort was made to bring together a number of psy- 
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chologists for the purpose of clarifying and bringing integration 
into the points of view and the theories relating to the three areas 
indicated in the title. The Kentucky Symposium was conducted 
and later found publication in this volume under the direction of 
Dr. Frank A. Pattie. 

The subjects with their authors are as follows in their respective 
order: Current Interpretations of Learning Data and Some Recent 
Developments in Stimulus-Response Theory (Kenneth W. Spence); 
Stimulus-Response Theory as Applied to Perception (Selos D. 
Wickens); Motivational Forces Underlying Learning (Harry F. 
Harlow); The Premature Crystallization of Learning Theory (Nor- 
man R. F. Maier); Learning and Explanation (Donald K. Adams); 
Ego Psychology, Cybernetics, and Learning Theory (O. H. Mow- 
rer); Personality Structures as Learning and Motivation Patterns 
—A Theme for the Integration of Methodologies (Raymond B. 
Cattell); Prospects and Perspectives in Psychotherapeutic Theory 
and Research (John M. Butler); Learning: an Aspect of Person- 
ality Development (Donald Snygg); Errors: Theory and Meas- 
urement (R. B. Ammons), etc. This kind of collection may be of 
great value to psychology. It should help very much in the devel- 
opment out of the ‘psychologies’ into the psychology which is not 
yet quite in the thinking of many workers in psychology who are 
still too limited and too sectarian. 

Careful reading of this symposium may emphasize the fact that 
some theorizing has been premature, that much more experimental 
and clinical data are needed, but, on the other hand, that theoret- 
ical considerations have not only been necessary for proper investi- 
gation, and that more comprehensive theories are necessary for the 
further development of psychology. 

The study has the advantages and disadvantages of multiple 
authorship, but the former outweighs the latter; the reader can 
better evaluate the present situation after considering the critical 
comments introduced by the writers. There is perhaps as much 
agreement as could be expected in the present development of the 
science. We are having the difficulties of terminology well known 
to the other sciences. We are dealing with exceedingly complex 
problems. We have probably succeeded in thinking up more points 
of view and modes of approach than any other science. And we 
have certainly not failed to fall back on the resources of the Eng- 
lish language in such ways as have seemed to many too compli- 
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cated and even at times pedantic. Considering the complexity and 
difficulty of the task undertaken, this volume can be thought of as 
clear, direct, not too technical, and not pedantic. 

The special recommendation for the book, is, the reviewer thinks, 
in the success it has achieved in the attempt to relate theoretically 
several fields or areas of psychology, and the stimulus it should 
give to better thinking and investigation in terms of a wider, more 
comprehensive view. With this kind of progress in psychology, no 
one need despair of the science except he who has hedged himself 
in with too narrow and puristic a definition, and has found that 
psychology has already burst the bounds of his definition and gone 
on like a living organism to become what was not dreamed of in the 
ancient armchair. 

The reader will find very frank, honest, sometimes devastating, 
but also constructive, evaluation of theory and theorizing in psy- 
chology. Some psychologists at least are willing to bring out the 
shortcomings of psychology and of psychological workers (shall we 
say especially of other psychologists?) but are as eager to point out 
the possible ways of improvement as far as they are understood. 

As these authors have disagreed with one another, so the reader 
can find his particular points of disagreement, but as a total con- 
tribution, he should find an excellent critical evaluation of many 
psychological matters that will well repay his careful study. 

A useful Index (pp. 161-164) concludes the book. 

A. 8S. Epwarps 
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The University of Georgia 
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